THERMOSTABLE REVERSE TRANSCRIPTASES AND USES THEREOF 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] The present application claims the benefit of United States provisional 
patent application number 60/410,283, filed September 13, 2002, the entire 
disclosure of which is incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0002] The present invention is in the fields of molecular and cellular biology. 

The invention is generally related to reverse transcriptase enzymes and 
methods for the reverse transcription of nucleic acid molecules, especially 
messenger RNA molecules. Specifically, the invention relates to reverse 
transcriptase enzymes which have been mutated or modified to increase 
thermostability, decrease terminal deoxynucleotidyl transferase activity, 
and/or increase fidelity, and to methods of producing, amplifying or 
sequencing nucleic acid molecules (particularly cDNA molecules) using these 
reverse transcriptase enzymes or compositions. The invention also relates to 
nucleic acid molecules produced by these methods and to the use of such 
nucleic acid molecules to produce desired polypeptides. The invention also 
relates to nucleic acid molecules encoding the reverse transcriptases of the 
invention, to vectors containing such nucleic acid molecules, and to host cells 
containing such nucleic acid molecules. The invention also concerns kits or 
compositions comprising such enzymes. 

Related Art 

cDNA and cDNA Libraries 
[0003] In examining the structure and physiology of an organism, tissue or 

cell, it is often desirable to determine its genetic content. The genetic 
framework of an organism is encoded in the double-stranded sequence of 
nucleotide bases in the deoxyribonucleic acid (DNA) which is contained in the 
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somatic and germ cells of the organism. The genetic content of a particular 
segment of DNA, or gene, is typically manifested upon production of the 
protein which the gene encodes. In order to produce a protein, a 
complementary copy of one strand of the DNA double helix is produced by 
RNA polymerase enzymes, resulting in a specific sequence of ribonucleic acid 
(RNA). This particular type of RNA, since it contains the genetic message 
from the DNA for production of a protein, is called messenger RNA (mRNA). 

[0004] Within a given cell, tissue or organism, there exist myriad mRNA 

species, each encoding a separate and specific protein. This fact provides a 
powerful tool to investigators interested in studying genetic expression in a 
tissue or cell. mRNA molecules may be isolated and further manipulated by 
various molecular biological techniques, thereby allowing the elucidation of 
the full functional genetic content of a cell, tissue or organism. 

[0005] One common approach to the study of gene expression is the 

production of complementary DNA (cDNA) clones. In this technique, the 
mRNA molecules from an organism are isolated from an extract of the cells or 
tissues of the organism. This isolation often employs solid chromatography 
matrices, such as cellulose or agarose, to which oligomers of thymidine (T) 
have been complexed. Since the 3' termini on most eukaryotic mRNA 
molecules contain a string of adenosine (A) bases, and since A base pairs with 
T, the mRNA molecules can be rapidly purified from other molecules and 
substances in the tissue or cell extract. From these purified mRNA molecules, 
cDNA copies may be made using the enzyme reverse transcriptase (RT), 
which results in the production of single-stranded cDNA molecules. This 
reaction is typically referred to as the first strand reaction. The single-stranded 
cDNAs may then be converted into a complete double-stranded DNA copy 
(i.e., a double-stranded cDNA) of the original mRNA (and thus of the original 
double-stranded DNA sequence, encoding this mRNA, contained in the 
genome of the organism) by the action of a DNA polymerase. The protein- 
specific double-stranded cDNAs can then be inserted into a plasmid or viral 
vector, which is then introduced into a host bacterial, yeast, animal or plant 
cell. The host cells are then grown in culture media, resulting in a population 
of host cells containing (or in many cases, expressing) the gene of interest. 



-3- 

[0006] This entire process, from isolation of mRNA from a source organism 

or tissue to insertion of the cDNA into a plasmid or vector to growth of host 
cell populations containing the isolated gene, is termed "cDNA cloning." The 
set of cDNAs prepared from a given source of mRNAs is called a "cDNA 
library." The cDNA clones in a cDNA library correspond to the genes 
transcribed in the source tissue. Analysis of a cDNA library can yield much 
information on the pattern of gene expression in the organism or tissue from 
which it was derived. 

Retroviral Reverse Transcriptase Enzymes 
[0007] Three prototypical forms of retroviral reverse transcriptase have been 

studied thoroughly. Moloney Murine Leukemia Virus (M-MLV) reverse 
transcriptase contains a single subunit of 78 kDa with RNA-dependent DNA 
polymerase and RNase H activity. This enzyme has been cloned and 
expressed in a fully active form in E. coli (reviewed in Prasad, V.R., Reverse 
Transcriptase, Cold Spring Harbor, New York: Cold Spring Harbor 
Laboratory Press, p. 135 (1993)). Human Immunodeficiency Virus (HIV) 
reverse transcriptase is a heterodimer of p66 and p51 subunits in which the 
smaller subunit is derived from the larger by proteolytic cleavage. The p66 
subunit has both a RNA-dependent DNA polymerase and an RNase H domain, 
while the p51 subunit has only a DNA polymerase domain. Active HIV 
p66/p51 reverse transcriptase has been cloned and expressed successfully in a 
number of expression hosts, including E. coli (reviewed in Le Grice, S.F.J., 
Reverse Transcriptase, Cold Spring Harbor, New York: Cold Spring Harbor 
Laboratory press, p. 163 (1993)). Within the HIV p66/p51 heterodimer, the 
5 1 -kD subunit is catalytically inactive, and the 66-kD subunit has both DNA 
polymerase and RNase H activity (Le Grice, S.F.J., et al, EMBO Journal 
70:3905 (1991); Hostomsky, Z., et al, J, Virol 66:3179 (1992)). Avian 
Sarcoma-Leukosis Virus (ASLV) reverse transcriptase, which includes but is 
not limited to Rous Sarcoma Virus (RSV) reverse transcriptase, Avian 
Myeloblastosis Virus (AMV) reverse transcriptase, Avian Erythroblastosis 
Virus (AEV) Helper Virus MCAV reverse transcriptase, Avian 
Myelocytomatosis Virus MC29 Helper Virus MCAV reverse transcriptase, 
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Avian Reticuloendotheliosis Virus (REV-T) Helper Virus REV-A reverse 
transcriptase, Avian Sarcoma Virus UR2 Helper Virus UR2AV reverse 
transcriptase, Avian Sarcoma Virus Y73 Helper Virus YAV reverse 
transcriptase, Rous Associated Virus (RAV) reverse transcriptase, and 
Myeloblastosis Associated Virus (MAV) reverse transcriptase, is also a 
heterodimer of two subunits, a (approximately 62 kDa) and (3 (approximately 
94 kDa), in which a is derived from p by proteolytic cleavage (reviewed in 
Prasad, V.R., Reverse Transcriptase, Cold Spring Harbor, New York: Cold 
Spring Harbor Laboratory Press (1993), p. 135). ASLV reverse transcriptase 
can exist in two additional catalytically active structural forms, pp and a 
(Hizi, A. and Joklik, W.K., J. Biol. Chem. 252: 2281 (1977)). Sedimentation 
analysis suggests ap and pp are dimers and that the a form exists in an 
equilibrium between monomelic and dimeric forms (Grandgenett, D.P., et al. 9 
Proc. Nat Acad. Sci. USA 70:230 (1973); Hizi, A. and Joklik, W.K., J. Biol 
Chem. 252:2281 (1977); and Soltis, D.A. and Skalka, A.M., Proc. Nat Acad. 
Sci. USA 55:3372 (1988)). The ASLV ap and PP reverse transcriptases are 
\ the only known examples of retroviral reverse transcriptase that include three 
different activities in the same protein complex: DNA polymerase, RNase H, 
and DNA endonuclease (integrase) activities (reviewed in Skalka, A.M., 
Reverse Transcriptase, Cold Spring Harbor, New York: Cold Spring Harbor 
Laboratory Press (1993), p. 193). The a form lacks the integrase domain and 
activity. 

[0008] Various forms of the individual subunits of ASLV reverse transcriptase 

have been cloned and expressed. These include a 98-kDa precursor 
polypeptide that is normally processed proteolytically to p and a 4 kDa 
polypeptide removed from the p carboxy end (Alexander, F., et al., J. Virol. 
61:534 (1987) and Anderson, D. et al, Focus 77:53 (1995)), and the mature P 
subunit (Weis, J.H. and Salstrom, J.S., U.S. Patent No. 4,663,290 (1987); and 
Soltis, D.A. and Skalka, A.M., Proc. Nat Acad. Sci. USA 85:3312 (1988)). 
{See also Werner S. and Wohrl B.M., Eur. J. Biochem. 267:4740-4144 (2000); 
Werner S. and Wohrl B.M., J. Virol. 74:3245-3252 (2000); Werner S. and 
Wohrl B.M., J. Biol. Chem. 274:26329-26336 (1999).) Heterodimeric RSV 
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ap reverse transcriptase has also been purified from E. coli cells expressing a 
cloned RSV p gene (Chernov, A.R, et al, Biomed. ScL 2:49 (1991)). 

Reverse Transcription Efficiency 

[0009] As noted above, the conversion of mRNA into cDNA by reverse 

transcriptase-mediated reverse transcription is an essential step in the study of 
proteins expressed from cloned genes. However, the use of unmodified 
reverse transcriptase to catalyze reverse transcription is inefficient for a 
number of reasons. First, reverse transcriptase sometimes degrades an RNA 
template before the first strand reaction is initiated or completed, primarily due 
to the intrinsic RNase H activity present in reverse transcriptase. In addition, 
mis-priming of the mRNA template molecule can lead to the introduction of 
errors in the cDNA first strand while secondary structure of the mRNA 
molecule itself may make some mRNAs refractory to first strand synthesis. 

[0010] Removal of the RNase H activity of reverse transcriptase can eliminate 

the first problem and improve the efficiency of reverse transcription (Gerard, 
G.F., et al., FOCUS U(4):60 (1989); Gerard, G.F., et al, FOCUS 14(3):9\ 
(1992)). However such reverse transcriptases ("RNase H-" forms) do not 
address the additional problems of mis-priming and mRNA secondary 
structure. 

[0011] Another factor which influences the efficiency of reverse transcription 

is the ability of RNA to form secondary structures. Such secondary structures 
can form, for example, when regions of RNA molecules have sufficient 
complementarity to hybridize and form double stranded RNA. Generally, the 
formation of RNA secondary structures can be reduced by raising the 
temperature of solutions which contain the RNA molecules. Thus, in many 
instances, it is desirable to reverse transcribe RNA at temperatures above 
37°C. However, art known reverse transcriptases generally lose activity when 
incubated at temperatures much above 37°C {e.g., 50°C). 

SUMMARY OF THE INVENTION 

[0012] The present invention provides, in part, reverse transcriptase enzymes, 

compositions comprising such enzymes and methods useful in overcoming 
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limitations of reverse transcription discussed above. In general, the invention 
provides compositions for use in reverse transcription of a nucleic acid 
molecule, these compositions comprising one or more (e.g., one, two, three, 
four, five, ten, fifteen, etc.) polypeptides having at least one reverse 
transcriptase activity. Such compositions may further comprise one or more 
(e.g., one, two, three, four, five, etc.) nucleotides (e.g. one or more 
fluorescentl-labeled nucleotides, one or more radiolabeled nucleotides, etc.), a 
suitable buffer, and/or one or more (e.g., one, two, three, four, five, ten, 
fifteen, etc.) DNA polymerases. Compositions of the invention may also 
comprise one or more (e.g., one, two, three, four, five, ten, fifteen, etc.) 
oligonucleotide primers, and/or one or more templates, and/or one or more 
nucleic acid molecules (which may be complementary to all or a portion of 
such templates). 

[0013] Reverse transcriptases of the invention are preferably modified or 

mutated such that the thermostability of the enzyme is increased or enhanced 
and/or the fidelity of the enzyme is increased or enhanced. In specific 
embodiments, reverse transcriptases of the invention may be single chained 
(single subunit) or multi-chained (multi-subunit) and may be reduced or 
substantially reduced in RNase H activity or may have no detectable RNase H 
activity or may be lacking in RNase H activity. Preferably enzymes of the 
invention are enzymes selected from the group consisting of Moloney Murine 
Leukemia Virus (M-MLV) RNase H- reverse transcriptase, Rous Sarcoma 
Virus (RSV) RNase H- reverse transcriptase, Avian Myeloblastosis Virus 
(AMV) RNase H- reverse transcriptase, Rous Associated Virus (RAV) RNase 
H- reverse transcriptase, Myeloblastosis Associated Virus (MAV) RNase H- 
reverse transcriptase or other ASLV RNase H- reverse transcriptases and 
Human Immunodeficiency Virus (HIV) RNase H- reverse transcriptase and 
mutants thereof. In preferred compositions, the reverse transcriptases are 
present at working concentrations. 

[0014] In certain aspects, the invention includes reverse transcriptases which 

have been modified or mutated to increase or enhance thermostability. 
Examples of such reverse transcriptases include enzymes comprising one or 
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more modifications or mutations at positions corresponding to amino acids 
selected from the group consisting of: 

(a) leucine 52 of M-MLV reverse transcriptase; 

(b) tyrosine 64 of M-MLV reverse transcriptase; 

(c) lysine 1 52 of M-MLV reverse transcriptase; 

(d) histidine 204 of M-MLV reverse transcriptase; 

(e) methionine 289 of M-MLV reverse transcriptase; 

(f) threonine 306 of M-MLV reverse transcriptase; and 

(g) phenylalanine 309 of M-MLV reverse transcriptase. 

In some embodiments, a modification or mutation may be the addition of an 
N- and/or C-terminal tag sequence. 

[0015] In specific embodiments, the invention is directed to M-MLV reverse 

transcriptases wherein leucine 52 is replaced with proline, tyrosine 64 is 
replaced with arginine, lysine 152 is replaced with methionine, histidine 204 is 
replaced with arginine, methionine 289 is replaced with leucine, threonine 306 
is replaced with either lysine or arginine, and/or phenylalanine 309 is replaced 
with asparagine or serine. Further included within the scope of the invention 
are reverse transcriptases, other than M-MLV reverse transcriptase, which 
contain alterations corresponding to those set out above. 

[0016] In additional aspects, the invention also include thermostable reverse 

transcriptases which retain at least about 50%, at least about 60%, at least 
about 70%, at least about 85%, at least about 95%, at least about 97%, at least 
about 98%, at least about 99%, at least about 100%, at least about 150%, at 
least about 200%, at least about 250%, or at least about 300% of reverse 
transcriptase activity after heating to 50°C for 5 minutes. 

[0017] As noted above, enzymes of the invention include reverse 

transcriptases which exhibit reverse transcriptase activity either upon the 
formation of multimers (e.g., dimers) or as individual protein molecules (i.e., 
in monomeric form). Examples of reverse transcriptases which exhibit reverse 
transcriptase activity upon the formation of multimers include AMV, RSV and 
HIV reverse transcriptases. One example of a reverse transcriptase which 
exhibits reverse transcriptase activity as separate, individual proteins (i.e., in 
monomeric form) is M-MLV reverse transcriptase. 
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[0018] Multimeric reverse transcriptases of the invention may form 

homo-multimers or hetero-multimers. In other words, the subunits of the 
multimeric protein complex may be identical or different. One example of a 
hetero-dimeric reverse transcriptase is AMV reverse transcriptase, which is 
composed of two subunits that differ in primary amino acid sequence. More 
specifically, as already discussed, AMV reverse transcriptase may be 
composed of two subunits wherein one of these subunits is generated by 
proteolytic processing of the other. Thus, dimeric AMV reverse transcriptase 
may be composed of subunits of differing size which share regions of amino 
acid sequence identity. 

[0019] The present invention relates in particular to mutant or modified 

reverse transcriptases wherein one or more (e.g., one, two, three,, four, five, 
ten, twelve, fifteen, twenty, etc.) amino acid changes have been made which 
renders the enzyme more thermostable in nucleic acid synthesis, as compared 
to the unmutated or unmodified reverse transcriptases. Sites for mutation or 
modification to produce the thermostable reverse transcriptase enzymes of the 
present invention and/or reverse transcriptases which exhibit other 
characteristics (eg., increased fidelity, decreased TdT activity, etc.) are listed 
for some reverse transcriptases in Table 1 . As will be appreciated by those 
skilled in the art, one or more of the amino acids identified may be deleted 
and/or replaced with one or a number of amino acid residues. In a preferred 
aspect, any one or more of the amino acids identified in Table 1 may be 
substituted with any one or more amino acid residues such as Ala, Arg, Asn, 
Asp, Cys, Gin, Glu, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, 
and/or Val. The modifications described in Table 1 preferably produce 
thermostable reverse transcriptases of the invention. Similar or equivalent 
sites or corresponding sites in other reverse transcriptases can be mutated or 
modified to produce additional thermostable reverse transcriptases, as well as 
reverse transcriptases which exhibit other characteristics (e.g., increased 
fidelity, decreased TdT activity, etc.). Thus, a reverse transcriptase of the 
present invention may have one or more of the following properties: (a) 
increased thermostability or increased half life at elevated temperatures; (b) 
reduced, substantially reduced, or no detectable RNase H activity, (c) reduced 
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or substantially reduced terminal deoxynucleotidyl transferase activity, and/or 
(d) increased fidelity. In some embodiments, a reverse transcriptase of the 
invention may have a plurality of the properties listed above {e.g., a reverse 
transcriptase may have enhanced thermostability, reduced RNase H activity, 
and enhanced fidelity). 



Table 1 



RT 


Amino Acids 


M-MLV 


L52, Y64, L135, H143, K152, Q165, G181, H204, 1218, N249, M289, 
T306, F309, A517, D524, T544, V546, W548, E562, H577, D583, 
L604, S606, G608, F625, L626, H629, H631, H638, G641 


AMV 


V2, L4, W12, P14, H16, T17, W20, 121, Q23, W24, L26, P27, G29, 
V32, Q36, L42, Q43, L44, G45, H46, 147, P49, S50, L51, S52, C53, 
W54, F59, 161, A64, S65, G66, S67, Y68, L70.L71, A76, A79, P83, 
A86, V87, Q88, Q89, G90, A91, W101, P102, L108, Q120, S131, 
V132, N133, N134, Q135, P137, A138, Q142, Q148, T151, Y180, 
M181, S190, H191, G193, A196, 1201, S202, P214, V217, Q218, P221, 
G222, Q224, L226, G227, Y228, G231, T233, Y234, A236, P237, 
G239, L240, P244, 1246, T248, W250, Q252, G257, Q260, W261, 
P264, L266, G267, L272, Y277, Q279, L280, G282, S283, P285, N286, 
A288, N292, L293, M297, 1302, V303, L305, S306, T308, L311, L320, 
1332, G333, V334, G336, Q337, G338, P345, W348, L349, F350, S351, 
P354, A357, F358, A360, W361, L362, V364, L365, T366, T370, A374, 
V377, G381, C392, P400, G402, L405, G412, 1414, F423, 1425, A426, 
P428, L433, H440, P441, V443, G444, P445, A451, S453, S454, T455, 
H456, G458, V459, V460, W462, W468, 1470, 1473, A474, L476, 
G477, A478, S479, V480, Q481, Q482, L483, A491, W495, P496, 
T497, T498, P499, T500, A507, F508, M512, L513, G520, V521, P522, 
S523, T524, A525, A527, F528, L534, S535, Q536, S538, V543, S548, 
H549, S550, V552, P553, F556, T557, N560, A562 
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Table 1 



RT 


Amino Acids 


RSV 


V2, IA, W12, P14, H16, T17, W20, 121, Q23, W24, L26, P27, G29, 
V32, Q36, L42, Q43, L44, G45, H46, 147, P49, S50, L51, S52, C53, 
W54, F59, 161, A64, S65, G66, S67, Y68, L70, L71, A76, A79, P83, 
A86, V87, Q88, Q89, G90, A91, W101, P102, L108, Q120, S131, 
V132, N133, N134, Q135, P137, A138, Q142, Q148, T151, Y180, 
M181, S190, H191, G193, A196, 1201, S202, P214, V217, Q218, P221, 
G222, Q224, L226, G227, Y228, G231, T233, Y234, A236, P237, 
G239, L240, P244, 1246, T248, W250, Q252, G257, Q260, W261, 
P264, L266, G267, L272, Y277, Q279, L280, G282, S283, P285, N286, 
A288, N292, L293, M297, 1302, V303, L305, S306, T308, L311, L320, 
1332, G333, V334, G336, Q337, G338, P345, W348, L349, F350, S351, 
P354, A357, F358, A360, W361, L362, V364, L365, T366, T370, A374, 
V377, G381, C392, P400, G402, L405, G412, 1414, F423, 1425, A426, 
P428, L433, H440, P441, V443, G444, P445, A451, S453, S454, T455, 
H456, G458, V459, V460, W462, W468, 1470, 1473, A474, L476, 
G477, A478, S479, V480, Q481, Q482, L483, A491, W495, P496, 
T497, T498, P499, T500, A507, F508, M512, L513, G520, V521, P522, 
S523, T524, A525, A527, F528, L534, S535, Q536, S538, V543, S548, 
H549, S550, V552, P553, F556, T557, N560, A562 


HIV 


11, P3, Lll, P13, G14, M15, Q22, W23, L25, T26, T38, G44, 146, S47, 
G50, P51, N53, P54, Y55, F60, 162, S67, T68, W70, L73, V89, 
Q90L91, G92, 193, S104, V110, Gill, S133, 1134, N135, N136, P139, 
G140, 1141, Q144, N146, Q150„ Y182, M183, 1194, G195, Q196, 
T,199, Q206, L209, P216, Q221, P224, P225, L227, M229, G230, 
Y231, H234, Q241, P242, V244, L245, S250, T252, N254, Q257, G261, 
N264, W265, Q268, P271, G272, Q277, C279, L281, L282, G284, 
T285, A287, L288, T289, V291, P293, L294, T295, L300, A303, 1308, 
L309, P312, H314, Y317, L324, 1328, Q329, G332, Q333, G334, Y341, 
P344, F345, Y353, M356, G358, A359, H360, T361, Q372, T376, 
V380, Q392, W405, Q406, A407, F415, V416, N417, T418, P419, 
P420, L424, W425, P432, V434, G435, A436, A444, A445, N446, 
T449, L451, N459, G461, Q463, V465, V466, P467, L468, T469, N470, 
T471, T472, N473, Q474, Y482, Q486, S488, G489, L490, Q499, 
Y500, G503, 1504, S512, S514, L516, N518, Q519, Q523, 1525, W534, 
P536, A537, H538, G540, 1541, G542, Q546, L550, S552, A553, V554, 
1555 



[0020] Those skilled in the art will appreciate that a different isolate of virus 

may encode a reverse transcriptase enzyme having a different amino acid at 
the positions identified above. Such isolates may be modified to produce the 
reverse transcriptases (e.g., thermostable reverse transcriptases) of the present 
invention. 

[0021] Reverse transcriptases of the invention may have one or more of the 

following properties: (a) increased thermostability or increased half-life at 



elevated temperatures; (b) reduced, substantially reduced, or no detectable 
RNase H activity, (c) reduced or substantially reduced terminal 
deoxynucleotidyl transferase activity, and/or (d) increased fidelity. 
[0022] Enzymes of the invention which have reduced or substantially reduced 

terminal deoxynucleotidyl transferase activity may comprise one or more 
modifications or mutations at positions corresponding to amino acids selected 
from the group consisting of: 

(a) tyrosine 133 of M-MLV reverse transcriptase; 

(b) threonine 197 of M-MLV reverse transcriptase; and 

(c) phenylalanine 309 of M-MLV reverse transcriptase. 

[0023] In specific embodiments, the invention is directed to M-MLV reverse 

transcriptases wherein tyrosine 133 is replaced with alanine, threonine 197 is 
replaced with glutamic acid, and/or phenylalanine 309 is replaced with 
asparagine. As will be appreciated, one or more of the amino acids identified 
may be deleted and/or replaced with one or a number of amino acid residues. 
Further included within the scope of the invention are reverse transcriptases, 
other than M-MLV reverse transcriptase, which contain alterations 
corresponding to those set out above. 

[0024] Additionally, enzymes which exhibit increased fidelity may comprise 

one or more modifications or mutations at positions corresponding to amino 
acids selected from the group consisting of: 

(a) tyrosine 64 of M-MLV reverse transcriptase; 

(b) arginine 1 1 6 of M-MLV reverse transcriptase; 

(c) glutamine 190 of M-MLV reverse transcriptase; and 

(d) valine 223 of M-MLV reverse transcriptase. 

[0025] As will be appreciated, one or more of the amino acids identified may 

be deleted and/or replaced with any one or a number of amino acid residues. 
Further, included in the invention are reverse transcriptases, other than M- 
MLV reverse transcriptase, that contain alterations corresponding to those set 
out above. 

[0026] In some embodiments, the present invention provides a modified or 

mutated reverse transcriptase (e.g., preferably a modified or mutated retroviral 
reverse transcriptase) having a reverse transcriptase activity that has a half-life 
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of greater than that of the corresponding un-modified or un-mutated reverse 
transcriptase at an elevated temperature, i.e., greater than 37°C. In some 
embodiments, the half-life of a reverse transcriptase of the present invention 
may be 5 minutes or greater and preferably 10 minutes or greater at 50°C. In 
some embodiments, the reverse transcriptases of the invention may have a 
half-life {e.g., at 50°C) equal to or greater than about 25 minutes, preferably 
equal to or greater than about 50 minutes, more preferably equal to or greater 
than about 100 minutes, and most preferably, equal to or greater than about 
200 minutes. 

[0027] In some embodiments, the reverse transcriptases of the invention may 

have a half-life at 50°C that is from about 10 minutes to about 200 minutes, 
from about 10 minutes to about 150 minutes, from about 10 minutes to about 
100 minutes, from about 10 minutes to about 75 minutes, from about 10 
minutes to about 50 minutes, from about 10 minutes to about 40 minutes, from 
about 10 minutes to about 30 minutes, or from about 10 minutes to about 20 
minutes. 

[0028] A modified or mutated reverse transcriptase of the invention (e.g., one 

having a half-life at 50°C as described above) may be a modified or mutated 
retroviral reverse transcriptase. A reverse transcriptase according to the 
invention may be selected from a group consisting of M-MLV reverse 
transcriptase, ASV reverse transcriptase, HIV reverse transcriptase, Avian 
Sarcoma-Leukosis Virus (ASLV) reverse transcriptase, Rous Sarcoma Virus 
(RSV) reverse transcriptase, Avian Myeloblastosis Virus (AMV) reverse 
transcriptase, Avian Erythroblastosis Virus (AEV) Helper Virus MCAV 
reverse transcriptase, Avian Myelocytomatosis Virus MC29 Helper Virus 
MCAV reverse transcriptase, Avian Reticuloendotheliosis Virus (REV-T) 
Helper Virus REV-A reverse transcriptase, Avian Sarcoma Virus UR2 Helper 
Virus UR2AV reverse transcriptase, Avian Sarcoma Virus Y73 Helper Virus 
YAV reverse transcriptase, Rous Associated Virus (RAV) reverse 
transcriptase, and Myeloblastosis Associated Virus (MAV) reverse 
transcriptase, and fragments of any of the above having reverse transcriptase 
activity. 
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Mutated or modified reverse transcriptases of the present invention 
may have a reverse transcriptase activity (e.g., RNA-dependent DNA 
polymerase activity) that has a longer half-life at 55°C than the reverse 
transcriptase activity of a corresponding un-mutated or un-modified reverse 
transcriptases. For example, introduction of the H204R, M289K, T306K, and 
F309N mutation into His 6 -H'RT increases the half-life at 55°C from 1.6 
minutes to 8.1 minutes (see Table 9). At 55°C, the half-life of reverse 
transcriptase activity of a mutated or modified reverse transcriptase of the 
invention may be greater than about 2 minutes, greater than about 3 minutes, 
greater than about 4 minutes, greater than about 5 minutes, greater than about 
6 minutes, greater than about 7 minutes, greater than about 8 minutes, greater 
than about 10 minutes, greater than about 15 minutes, greater than about 20 
minutes, or greater than about 30 minutes. At 55°C, the half-life of reverse 
transcriptase activity of a reverse transcriptase of the invention may be from 
about 2 minutes to about 60 minutes, from about 2 minutes to about 45 
minutes, from about 2 minutes to about 30 minutes, from about 2 minutes to 
about 20 minutes, from about 2 minutes to about 15 minutes, from about 2 
minutes to about 10 minutes, from about 2 minutes to about 8 minutes, from 
about 2 minutes to about 7 minutes, from about 2 minutes to about 6 minutes, 
from about 2 minutes to about 5 minutes, from about 2 minutes to about 4 
minutes, or from about 2 minutes to about 3 minutes. Such a reverse 
transcriptase may be a modified or mutant retroviral reverse transcriptase. 

A modified or mutated reverse transcriptase of the invention (e.g., one 
having a half-life at 55°C as described above) may be a modified or mutated 
retroviral reverse transcriptase. A mutated reverse transcriptase according to 
the present invention may be selected from a group consisting of M-MLV 
reverse transcriptase, ASV reverse transcriptase, HIV reverse transcriptase, 
Avian Sarcoma-Leukosis Virus (ASLV) reverse transcriptase, Rous Sarcoma 
Virus (RSV) reverse transcriptase, Avian Myeloblastosis Virus (AMV) 
reverse transcriptase, Avian Erythroblastosis Virus (AEV) Helper Virus 
MCAV reverse transcriptase, Avian Myelocytomatosis Virus MC29 Helper 
Virus MCAV reverse transcriptase, Avian Reticuloendotheliosis Virus (REV- 
T) Helper Virus REV-A reverse transcriptase, Avian Sarcoma Virus UR2 
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Helper Virus UR2AV reverse transcriptase, Avian Sarcoma Virus Y73 Helper 
Virus YAV reverse transcriptase, Rous Associated Virus (RAV) reverse 
transcriptase, and Myeloblastosis Associated Virus (MAV) reverse 
transcriptase and fragments of any of the above having reverse transcriptase 
activity. 

[0031] Reverse transcriptases of the present invention may produce more 

product (e.g., full length product) at elevated temperatures than other reverse 
transcriptases. In one aspect, comparisons of full length product synthesis is 
made at different temperatures (e.g., one temperature being lower, such as 
between 37°C and 50°C, and one temperature being higher, such as between 
50°C and 78°C) while keeping all other reaction conditions similar or the 
same. The amount of full length product produced may be determined using 
techniques well known in the art, for example, by conducting a reverse 
transcription reaction at a first temperature (e.g., 37°C, 38°C, 39°C, 40°C, etc.) 
and determining the amount of full length transcript produced, conducting a 
second reverse transcription reaction at a temperature higher than the first 
temperature (e.g., 45°C, 50°C, 52.5°C, 55°C, etc.) and determining the amount 
of full length product produced, and comparing the amounts produced at the 
two temperatures. A convenient form of comparison is to determine the 
percentage of the amount of full length product at the first temperature that is 
produced at the second (i.e., elevated) temperature. The reaction conditions 
used for the two reactions (e.g., salt concentration, buffer concentration, pH, 
divalent metal ion concentration, nucleoside triphosphate concentration, 
template concentration, reverse transcriptase concentration, primer 
concentration, length of time the reaction is conducted, etc.) are preferably the 
same for both reactions. Suitable reaction conditions include, but are not 
limited to, a template concentration of from about 1 nM to about 1 \iM, from 
about 100 nM to 1 jiM, from about 300 nM to about 750 nM, or from about 
400 nM to about 600 nM, and a reverse transcriptase concentration of from 
about 1 nM to about 1 jaM, from about 10 nM to 500 nM, from about 50 nM 
to about 250 nM, or from about 75 nM to about 125 nM. The ratio of the 
template concentration to the reverse transcriptase concentration may be from 
about 100:1 to about 1:1, from about 50:1 to about 1:1, from about 25:1 to 
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about 1:1, from about 10:1 to about 1:1, from about 5:l-to about 1:1, or from 
about 2.5:1 to 1:1. A reaction may be conducted from about 5 minutes to 
about 5 hours, from about 10 minutes to about 2.5 hours, from about 30 
minutes to about 2 hours, from about 45 minutes to about 1.5 hours, or from 
about 45 minutes to about 1 hour. A suitable reaction time is about one hour. 
Other suitable reaction conditions may be determined by those skilled in the 
art using routine techniques and examples of such conditions are provided 
below. 

I When the amount of full length product produced by a reverse 

transcriptase of the invention at an elevated temperature is compared to the 
amount of full length product produced by the same reverse transcriptase at a 
lower temperature, at an elevated temperature, the reverse transcriptases of the 
invention may produce not less than about 25%, 35%, 45%, 55%, 65%, 75%, 
85%), 95%, 100% of the amount of full length product produced at the lower 
temperature. In some cases, the reverse transcriptases of the invention may 
produce an amount of full length product at a higher temperature that is 
greater than the amount of full length product produced by the reverse 
transcriptase at a lower temperature (e.g., 1% to about 100% greater). In one 
aspect, reverse transcriptases of the invention produce approximately the same 
amount (e.g., no more than a 25% difference) of full length product at the 
lower temperature compared to the amount of full length product made at the 
higher temperature. 

A reverse transcriptase of the present invention may be one that 
synthesizes an amount of full length product, wherein the amount of full 
length product synthesized at 50°C is no less than 10% (e.g., from about 10% 
to about 95%, from about 10% to about 80%, from about 10% to about 70%, 
from about 10% to about 60%, from about 10% to about 50%, from about 
10% to about 40%, from about 10% to about 30%, or from about 10% to about 
20%) of the amount of full length product it synthesizes at 40°C. In some 
embodiments, a reverse transcriptase of the invention is one wherein the 
amount of full length product synthesized at 50°C is no less than 50% (e.g., 
from about 50% to about 95%, from about 50% to about 80%, from about 
50% to about 70%, or from about 50% to about 60%) of the amount of full 
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length product it synthesizes at 40°C. In some embodiments, a reverse 
transcriptase of the invention is one wherein the amount of full length product 
synthesized at 50°C is no less than 75% {e.g., from about 75% to about 95%, 
from about 75%, to about 90%, from about 75% to about 85%, or from about 
75% to about 80%) of the amount of full length product it synthesizes at 40°C. 
In other embodiments, a reverse transcriptase of the invention is one wherein 
the amount of full length product synthesized at 50°C is no less than 85% 
(e.g., from about 85% to about 95%, or from about 85% to about 90%) of the 
amount of full length product it synthesizes at 40°C. 

[0034] A reverse transcriptase of the invention may be one that synthesizes an 

amount of full length product, wherein the amount of full length product 
synthesized at 52.5°C is no less than 10% (e.g., from about 10% to about 30%, 
from about 10% to about to about 25%, from about 10% to about 20%, from 
about 10% to about 15%, from about 20% to about 60%, from about 20% to 
about 40%, from about 20% to about 30%, from about 30% to about 80%, 
from about 30% to about 60%, from about 30% to about 45%, from about 
40% to about 90%, from about 40% to about 80%, from about 40% to about 
60%, from about 40% to about 50% from about 50% to about 90%, or from 
about 50% to about 70%), of the amount of full length product it synthesizes 
at 40°C. In some embodiments, the amount of full length product synthesized 
at 52.5°C is no less than 30% (e.g., from about 30% to about 70%, from about 
30% to about 60%, from about 30% to about 50%, or from about 30% to about 
40%) of the amount of full length product it synthesizes at 40°C. In some 
embodiments, the amount of full length product synthesized at 52.5°C is no 
less than 50% (e.g., from about 50% to about 70%, from about 50% to about 
65%, from about 50% to about 60%, or from about 50% to about 55%), of the 
amount of full length product it synthesizes at 40°C. 

[0035] A reverse transcriptase of the invention may be one that synthesizes an 

amount of full length product, wherein the amount of full length product 
synthesized at 55°C is no less than 1% (e.g., from about 1% to about 30%, 
from about 1% to about 25%, from about 1% to about 20%, from about 1% to 
about 15%, from about 1% to about 10%, or from about 1% to about 5%) of 
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the amount of full length product it synthesizes at 40°C. In some 
embodiments, the amount of full length product synthesized at 55°C is no less 
than 5% (e.g., from about 5% to about 30%, from about 5% to about to about 
25%, from about 5% to about 20%, from about 5% to about 15%, or from 
about 5% to about 10%) of the amount of full length product it synthesizes at 
40°C. In some embodiments, the amount of full length product synthesized at 
55°C is no less than 10% (e.g., from about 10% to about 30%, from about 
10% to about to about 25%, from about 10% to about 20%, from about 10% to 
about 15%, from about 20% to about 60%, from about 20% to about 40%, 
from about 20% to about 30%, from about 30% to about 80%, from about 
30% to about 60%, from about 30% to about 45%, from about 40% to about 
90%, from about 40% to about 80%, from about 40% to about 60%, from 
about 40% to about 50% from about 50% to about 90%, or from about 50% to 
about 70%) of the amount of full length product it synthesizes at 40°C. 
[0036] In another aspect, the reverse transcriptases of the invention are 

capable of producing more nucleic acid product (e.g., cDNA) and, preferably, 
more full length product, at one or a number of elevated temperatures 
(typically between 40°C an 78°C) compared to the corresponding un-mutated 
or un-modified reverse transcriptase (e.g., the control reverse transcriptase). 
Such comparisons are typically made under similar or the same reaction 
conditions and the amount of product synthesized by the control reverse 
transcriptase is compared to the amount of product synthesized by the reverse 
transcriptase of the invention. Preferably, the reverse transcriptases of the 
invention produce at least about 5%, at least 10%, at least 15%, at least 25%, 
at least 50%, at least 75%, at least 100%, or at least 200% more product or full 
length product compared to the corresponding control reverse transcriptase 
under the same reaction conditions and temperature. The reverse 
transcriptases of the invention preferably produce from about 10% to about 
200%, from about 25% to about 200%, from about 50% to about 200%, from 
about 75% to about 200%, or from about 100% to about 200% more product 
or full length product compared to a control reverse transcriptase under the 
same reaction conditions and incubation temperature. The reverse 
transcriptases of the invention preferably produce at least 2 times, at least 3 
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times, at least 4 times, at least 5 times, at least 6 times, at least 7 times, at least 
8 times, at least 9 times, at least 10 times, at least 25 times, at least 50 times, at 
least 75 times, at least 100 times, at least 150 times, at least 200 times, at least 
300 times, at least 400 times, at least 500 times, at least 1000 times, at least 
5,000 times, or at least 10,000 times more product or full length product 
compared to a control reverse transcriptase (e.g., the corresponding un- 
mutated or un-modified reverse transcriptase) under the same reaction 
conditions and temperature. The reverse transcriptases of the invention 
preferably produce from 2 to 10,000, 5 to 10,000, 10 to 5,000, 50 to 5,000, 50 
to 500, 2 to 500, 5 to 500, 5 to 200, 5 to 100, or 5 to 75 times more product or 
full length product than a control reverse transcriptase under the same reaction 
conditions and temperature. 

In one aspect, the reverse transcriptases of the invention produce, at 
50°C, at least 25% more, preferably at least 50% more and more preferably at 
least 100% more nucleic acid product or full length product than a control 
reverse transcriptase (which is preferably the corresponding wild-type reverse 
transcriptase). In another aspect, at 52.5°C, the reverse transcriptases of the 
invention produce at least 1.5 times, at least 2 times, at least 2.5 times, at least 
3 times, at least 4 times, at least 5 times, at least 6 times, at least 7 times, at 
least 8 times, at least 9 times, at least 10 times the amount of nucleic acid 
product or full length product compared to a control reverse transcriptase. In 
another aspect, at 55°C, the reverse transcriptases of the invention produce at 
least 2 times, at least 5 times, at least 10 times, at least 15 times, at least 20 
times, at least 25 times, at least 50 times, at least 75 times, at least 100 times 
the amount of nucleic acid product or full length product compared to a 
control reverse transcriptase. Such comparisons are preferably made under the 
same reaction conditions and temperature. 

Modified or mutated reverse transcriptases of the present invention 
may have an increased thermostability at elevated temperatures as compared 
to corresponding un-modified or un-mutated reverse transcriptases. They may 
show increased thermostability in the presence or absence an RNA template. 
In some instances, reverse transcriptases of the invention may show an 
increased thermostability in both the presence and absence of an RNA 



template. Those skilled in the art will appreciate that reverse transcriptase 
enzymes are typically more thermostable in the presence of an RNA template. 
The increase in thermostability may be measured by comparing suitable 
parameters of the modified or mutated reverse transcriptase of the invention to 
those of a corresponding un-modified or un-mutated reverse transcriptase. 
Suitable parameters to compare include, but are not limited to, the amount of 
product and/or full length product synthesized by the modified or mutated 
reverse transcriptase at an elevated temperature compared to the amount or 
product and/or full length product synthesized by the corresponding un- 
modified or un-mutated reverse transcriptase at the same temperature, and/or 
the half-life of reverse transcriptase activity at an elevated temperature of a 
modified or mutated reverse transcriptase at an elevated temperature compared 
to that of a corresponding un-modified or un-mutated reverse transcriptase. 

[0039] A modified or mutated reverse transcriptase of the invention may have 

an increase in thermostability at 50°C of at least about 1.5 fold (e.g., from 
about 1.5 fold to about 100 fold, from about 1.5 fold to about 50 fold, from 
about 1.5 fold to about 25 fold, from about 1.5 fold to about 10 fold) 
compared, for example, to the corresponding un-mutated or un-modified 
reverse transcriptase. A reverse transcriptase of the invention may have an 
increase in thermostability at 50°C of at least about 10 fold (e.g., from about 
10 fold to about 100 fold, from about 10 fold to about 50 fold, from about 10 
fold to about 25 fold, or from about 10 fold to about 15 fold) compared, for 
example, to the corresponding un-mutated or un-modified reverse 
transcriptase. A reverse transcriptase of the invention may have an increase in 
thermostability at 50°C of at least about 25 fold (e.g., from about 25 fold to 
about 100 fold, from about 25 fold to about 75 fold, from about 25 fold to 
about 50 fold, or from about 25 fold to about 35 fold) compared to a 
corresponding un-mutated or un-modified reverse transcriptase. 

[0040] The present invention also contemplates a modified or mutated 

thermostable reverse transcriptase, wherein the reverse transcriptase has an 
increase in thermostability of greater than about 1.5 fold at 52.5°C (e.g., from 
about 1.5 fold to about 100 fold, from about 1.5 fold to about 50 fold, from 
about 1.5 fold to about 25 fold, or from about 1.5 fold to about 10 fold) 
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compared, for example, to the corresponding- un-mutated or un-modified 
reverse transcriptase. A reverse transcriptase of the invention may have an 
increase in thermostability at 52.5°C of at least about 10 fold (e.g., from about 
10 fold to about 100 fold, from about 10 fold to about 50 fold, from about 10 
fold to about 25 fold, or from about 10 fold to about 15 fold) compared, for 
example, to the corresponding un-mutated or un-modified reverse 
transcriptase. A reverse transcriptase of the invention may have an increase in 
thermostability at 52.5°C of at least about 25 fold (e.g., from about 25 fold to 
about 100 fold, from about 25 fold to about 75 fold, from about 25 fold to 
about 50 fold, or from about 25 fold to about 35 fold) compared, for example, 
to the corresponding un-mutated or un-modified reverse transcriptase. 

[0041] In other embodiments, the present Invention provides a reverse 

transcriptase, wherein the reverse transcriptase has an increase in 
thermostability of greater than about 1.5 fold at 55°C (e.g., from about 1.5 fold 
to about 100 fold, from about 1.5 fold to about 50 fold, from about 1.5 fold to 
about . 25 fold, or from about 1.5 fold to about 10 fold) compared to a 
corresponding un-mutated or un-modified reverse transcriptase. In some 
embodiments, a reverse transcriptase of the invention may have an increase in 
thermostability at 55°C of at least about 10 fold (e.g., from about 10 fold to 
about 100 fold, from about 10 fold to about 50 fold, from about 10 fold to 
about 25 fold, or from about 10 fold to about 15 fold) compared to a 
corresponding un-mutated or un-modified reverse transcriptase. In some 
embodiments, a reverse transcriptase of the invention may have an increase in 
thermostability at 55°C of at least about 25 fold (e.g., from about 25 fold to 
about 100 fold, from about 25 fold to about 75 fold, from about 25 fold to 
about 50 fold, or from about 25 fold to about 35 fold) compared to a 
corresponding un-mutated or un-modified reverse transcriptase. 

[0042] The present invention provides reverse transcriptase enzymes, 

compositions and kits comprising such enzymes, and methods useful in 
preparing labeled nucleic acid molecules by reverse transcription. In general, 
the invention relates to the use of polypeptides of the invention (e.g., reverse 
transcriptase enzymes having one or more of the mutations identified above) 
to synthesized labeled nucleic acid molecules. In some embodiments, 
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polypeptides of the invention may be heterodimers and more specifically two 
subunit enzymes (e.g., dimers) such as HIV RT and ASLV RTs. In some 
embodiments, polypeptides of the invention may be single sub-unit enzymes 
(e.g., M-MLV reverse transcriptase). Preferably, such labeling involves the 
use of modified nucleotides (e.g., labeled nucleotides, particularly 
fluorescently labeled nucleotides, nucleotide analogs and the like) and one or 
more nucleic acid templates (preferably RNA and most preferably mRNA). In 
accordance with the invention, one or more labeled nucleic acid molecules are 
synthesized which are complementary to all or a portion of the one or more 
templates. The labeled nucleic acid molecules preferably have one or more 
labeled nucleotides incorporated into the synthesized molecule and in a 
preferred aspect, the labels are one or more fluorescent labels (which may be 
the same or different). In another aspect, nucleotides are used during nucleic 
acid synthesis using the reverse transcriptases of the invention to produce one 
or more nucleic acid molecules complementary to all or a portion of one or 
more templates. In such aspects, such nucleotides, which are incorporated in 
the synthesized nucleic acid molecules, may be modified (before or after 
incorporation) to contain one or more labels , which may then be detected. 
[0043] The invention also relates to compositions for use in the invention and 

such compositions may comprise one or more polypeptides of the invention 
(e.g., single sub-unit such as M-MLV RT and/or multi-subunit RTs such as 
HIV and ASLV RTs). Such compositions may further comprise one or more 
nucleotides, a suitable buffer, and/or one or more DNA polymerases. The 
compositions of the invention may also comprise one or more primers. The 
reverse transcriptases in these compositions preferably have RNase H activity 
or are reduced or substantially reduced in RNase H activity, and most 
preferably are enzymes selected from the group consisting of Moloney Murine 
Leukemia Virus (M-MLV) reverse transcriptase, Rous Sarcoma Virus (RSV) 
reverse transcriptase, Avian Myeloblastosis Virus (AMV) reverse 
transcriptase, Rous Associated Virus (RAV) reverse transcriptase, 
Myeloblastosis Associated Virus (MAV) reverse transcriptase and Human 
Immunodeficiency Virus (HIV) reverse transcriptase or other ASLV reverse 
transcriptases. The reverse transcriptases of the invention may be composed 
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of one or more subunits (which may be the same or different). When two 
subunit RTs are use in the practice of the invention, such enzymes may 
contain various forms and combinations of such subunits such as a(i, aa, pp, 
etc. and mutants, variants or derivatives thereof. In preferred compositions, 
the reverse transcriptases are present at working concentrations. 

The invention is also directed to methods for making one or more 
nucleic acid molecules and/or labeled nucleic acid molecules, comprising 
mixing one or more nucleic acid templates (preferably one or more RNA 
templates and most preferably one or more messenger RNA templates) with 
one or more polypeptides of the invention having reverse transcriptase activity 
and incubating the mixture under conditions sufficient to synthesize one or 
more first nucleic acid molecules complementary to all or a portion of the one 
or more nucleic acid templates, wherein said at least one of said synthesized 
molecules are optionally labeled and/or comprise one or more labeled 
nucleotides and/or wherein said synthesized molecules may optionally be 
modified to contain one or more labels. In a preferred embodiment, the one or. 
more first nucleic acid molecules are single-stranded cDNA molecules. 
Nucleic acid templates suitable for reverse transcription according to this 
aspect of the invention include any nucleic acid molecule or population of 
nucleic acid molecules (preferably RNA and most preferably mRNA), 
particularly those derived from a cell or tissue. In a preferred aspect, a 
population of mRNA molecules (a number of different mRNA molecules, 
typically obtained from cells or tissue) are used to make a labeled cDNA 
library, in accordance with the invention. Preferred sources of nucleic acid 
templates include viruses, virally infected cells, bacterial cells, fungal cells, 
plant cells and animal cells. 

The invention also concerns methods for making one or more double- 
stranded nucleic acid molecules (which may optionally be labeled). Such 
methods comprise (a) mixing one or more nucleic acid templates (preferably 
RNA or mRNA, and more preferably a population of mRNA templates) with 
one or more polypeptides of the invention having reverse transcriptase 
activity; (b) incubating the mixture under conditions sufficient to make one or 
more first nucleic acid molecules complementary to all or a portion of the one 
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or more templates; and (c) incubating the one or more- first nucleic acid 
molecules under conditions sufficient to make one or more second nucleic 
acid molecules complementary to all or a portion of the one or more first 
nucleic acid molecules, thereby forming one or more double-stranded nucleic 
acid molecules comprising the first and second nucleic acid molecules. In 
accordance with the invention, the first and/or second nucleic acid molecules 
may be labeled {e.g., may comprise one or more of the same or different 
labeled nucleotides and/or may be modified to contain one or more of the 
same or different labels). Thus, labeled nucleotides may be used at one or 
both synthesis steps. Such methods may include the use of one or more DNA 
polymerases as part of the process of making the one or more double-stranded 
nucleic acid molecules. The invention also" concerns compositions useful for 
making such double-stranded nucleic acid molecules. Such compositions 
comprise one or more reverse transcriptases of the invention and optionally 
one or more DNA polymerases, a suitable buffer and/or one or more 
nucleotides (preferably including labeled nucleotides). 
[0046] The invention is also directed to nucleic acid molecules and/or labeled 

nucleic acid molecules (particularly single- or double-stranded cDNA 
molecules) produced according to the above-described methods and to kits 
comprising these nucleic acid molecules. Such molecules or kits may be used 
to detect nucleic acid molecules (for example by hybridization) or for 
diagnostic purposes. 

[0047] The invention is also directed to kits for use in the methods of the 

invention. Such kits can be used for making nucleic acid molecules and/or 
labeled nucleic acid molecules (single- or double-stranded). Kits of the 
invention may comprise a carrier, such as a box or carton, having in close 
confinement therein one or more containers, such as vials, tubes, bottles and 
the like. In kits of the invention, a first container may contain one or more of 
the reverse transcriptase enzymes of the invention or one or more of the 
compositions of the invention. Kits of the invention may also comprise, in the 
same or different containers, at least one component selected from one or 
more DNA polymerases (preferably thermostable DNA polymerases), a 
suitable buffer for nucleic acid synthesis and one or more nucleotides. 
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Alternatively, the components of the kit -may be divided into separate 
containers. In one aspect, kits of the invention comprise reverse transcriptases 
which have RNase H activity or are reduced or substantially reduced in RNase 
H activity (or lacking or having undetectable RNase H activity). Such RTs 
preferably are selected from the group consisting of M-MLV reverse 
transcriptase, RSV reverse transcriptase, AMV reverse transcriptase, RAV 
reverse transcriptase, MAV reverse transcriptase and HIV reverse 
transcriptase. In additional preferred kits of the invention, the enzymes (e.g. 
reverse transcriptases and/or DNA polymerases) in the containers are present 
at working concentrations. 

[0048] In specific embodiments, reverse transcriptases of the invention may 

not include M-MLV reverse transcriptases, HIV reverse transcriptases, AMV 
reverse transcriptases, and/or RSV reverse transcriptases. Thus, for example, 
in certain embodiments, the invention is directed to reverse transcriptases with 
increased thermostability that are not a HIV reverse transcriptase. In other 
embodiments, the invention is directed to reverse transcriptases with increased 
thermostability that are not a M-MLV reverse transcriptase. In yet other 
embodiments, the invention is directed to reverse transcriptases with increased 
thermostability that are not an AMV reverse transcriptase. In still other 
embodiments, the invention is directed to reverse transcriptases with increased 
thermostability that are not a RSV reverse transcriptase. 

[0049] The present invention is also directed to nucleic acid molecules (e.g., 

vectors) containing a gene or nucleic acid molecules encoding the mutant or 
modified reverse transcriptases of the present invention (or fragments thereof 
including fragments having polymerase activity) and to host cells containing 
such DNA or other nucleic acid molecules. Any number of hosts may be used 
to express the gene or nucleic acid molecule of interest, including prokaryotic 
and eukaryotic cells. In specific embodiments, prokaryotic cells are used to 
express the reverse transcriptases of the invention. One example of a 
prokaryotic host suitable for use with the present invention is Escherichia coli. 
Examples of eukaryotic hosts suitable for use with the present invention 
include fungal cells (e.g., Saccharomyces cerevisiae cells, Pichia pastoris 
cells, etc.), plant cells, and animal cells (e.g., Drosophila melanogaster cells, 
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Spodoptera frugiperda Sf9 and S£21 cells,. Trichoplusa High-Five cells, C. 
elegans cells, Xenopus laevis cells, CHO cells, COS cells, VERO cells, BHK 
cells, etc.). Preferably, polypeptides of the invention may be purified and/or 
isolated from a cell or organism expressing them, which may be a wild type 
cell or organism or a recombinant cell or organism. In some embodiments, 
such polypeptides may be substantially isolated from the cell or organism in 
which they are expressed. 
[0050] The invention also relates to a method of producing reverse 

transcriptases of the invention, said method comprising: 

(a) culturing a host cell comprising a gene or other nucleic acid 
molecule encoding a reverse transcriptase of the invention (preferably such 
reverse transcriptase gene or other nucleic acidv molecule is contained by a 
vector within the host cell); 

(b) expressing the gene or nucleic acid molecule; and 

(c) isolating or purifying said reverse transcriptase. 

[0051] The invention is also directed to methods for making one or more {e.g., 

one, two, three, four, five, ten, twelve, fifteen, etc.) nucleic acid molecules, 
comprising mixing one or more (e.g., one, two, three, four, five, ten, twelve, 
fifteen, etc.) nucleic acid templates (preferably one or more RNA templates 
and most preferably one or more messenger RNA templates or a population of 
messenger RNA templates) with one or more (e.g., one, two, three, four, five, 
ten, fifteen, etc.) reverse transcriptases of the invention and incubating the 
mixture under conditions sufficient to make a first nucleic acid molecule or 
molecules complementary to all or a portion of the one or more nucleic acid 
templates. In some embodiments, the mixture is incubated at an elevated 
temperature, i.e., greater than 37°C. In specific embodiments, the elevated 
temperature may be from about 40°C or greater, from about 45 °C or greater, 
from about 50°C or greater, from about 51°C or greater, from about 52°C or 
greater, from about 53 °C or greater, from about 54°C or greater, from about 
55°C or greater, from about 56°C or greater, from about 57°C or greater, from 
about 58°C or greater, from about 59°C or greater, from about 60°C or greater, 
from about 61°C or greater, from about 62°C or greater, from about 63°C or 
greater, from about 64°C or greater, from about 65 °C or greater, from about 
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66°C or greater, from about 67°C or greater, .from about 68 °C or greater, from 
about 69°C or greater, from about 70°C or greater, from about 71°C or greater, 
from about 72°C or greater, from about 73°C or greater, from about 74°C or 
greater, from about 75 °C or greater, from about 76°C or greater, from about 
77°C or greater, or from about 78 °C or greater. An elevated temperature may 
be within a temperature range of from about 40°C to about 45°C, from about 
40°C to about 48°C, from about 40°C to about 50°C, from about 40°C to 
about 52°C, from about 40°C to about 55°C, from about 40°C to about 58°C, 
from about 40°C to about 60°C, from about 40°C to about 65°C, from about 
42°C to about 45°C, from about 42°C to about 48°C, from about 42°C to 
about 50°C, from about 42°C to about 52°C, from about 42°C to about 55°C, 
from about 42°C to about 58°C, from about 42?€ to about 60°C, from about 
42°C to about 65°C, from about 45°C to about 48°C, from about 45°C to 
about 50°C, from about 45°C to about 52°C, from about 45°C to about 55°C, 
from about 45°C to about 58°C, from about 45°C to about 60°C, from about 
45°C to about 65°C, from about 48°C to about 50°C, from about 48°C to 
about 52°C, from about 48°C to about 55°C, from about 48°C to about 58°C, 
from about 48°C to about 60°C, from about 48°C to about 65°C, from about 
50°C to about 52°C, from about 50°C to about 55°C, from about 50°C to 
about 58°C, from about 50°C to about 60°C, from about 50°C to about 65°C, 
from about 52°C to about 55°C, from about 52°C to about 58°C, from about 
52°C to about 60°C, from about 52°C to about 65°C, from about 55°C to 
about 58°C, from about 55°C to about 60°C, from about 55°C to about 65°C, 
from about 55°C to about 70°C, from about 58°C to about 60°C, from about 
58°C to about 65°C, from about 58°C to about 70°C. An elevated temperature 
may be within a temperature range from about 37°C to about 75°C, from 
about 40°C to about 75°C, from about 45°C to about 75°C, from about 50°C 
to about 75°C, from about 51°C to about 75°C, from about 52°C to about 
75°C, from about 53°C to about 75°C, from about 54°C to about 75°C, from 
about 55°C to about 75°C. In other embodiments, the elevated temperature 
may be within the range of about 50°C to about 70°C, from about 51°C to 
about 70°C, from about 52°C to about 70°C, from about 53 °C to about 70°C, 
from about 54°C to about 70°C, from about 55°C to about 70°C, from about 
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56°C to about 65°C, from about 56°C to about 64°C or about 56°C to about 
62°C. In other embodiments, the elevated temperature may be within the 
range of about 46°C to about 60°C, from about 47°C to about 60°C, from 
about 49°C to about 60°C, from about 51°C to about 60°C, from about 53°C 
to about 60°C, or from about 54°C to about 60°C. In additional specific 
embodiments, the first nucleic acid molecule is a single-stranded cDNA. The 
invention further includes nucleic acid molecules prepared by the above 
methods and reaction mixtures used in and formed by such methods. Such 
conditions for incubation may include the use of one or more buffers or 
buffering salts, one or more primers (such as oligo dT primers) and/or one or 
more nucleotides (e.g.; one or more nucleoside triphosphates). The invention 
also concerns compositions for making one or more nucleic acid molecules 
comprising one or more components selected from the group consisting of one 
or more reverse transcriptases of the invention, one or more primers, one or 
more nucleotides and one or more suitable buffers. 
[0052] Nucleic acid templates suitable for reverse transcription according to 

this aspect of the invention include any nucleic acid molecule or population of 
nucleic acid molecules (preferably RNA and most preferably mRNA), 
particularly those derived from a cell or tissue. In a specific aspect, a 
population of mRNA molecules (a number of different mRNA molecules, 
typically obtained from a particular cell or tissue type) is used to make a 
cDNA library, in accordance with the invention. Examples of cellular sources 
of nucleic acid templates include bacterial cells, fungal cells, plant cells and 
animal cells. 

[0053] The invention also concerns methods for making one or more (e.g., 

one, two, three, four, five, ten, twelve, fifteen, etc.) double-stranded nucleic 
acid molecules. Such methods comprise (a) mixing one or more nucleic acid 
templates (preferably RNA or mRNA, and more preferably a population of 
mRNA templates) with one or more (e.g., one, two, three, four, five, ten, 
fifteen, etc.) reverse transcriptases of the invention; (b) incubating the mixture 
under conditions sufficient to make a first nucleic acid molecule or molecules 
complementary to all or a portion of the one or more templates; and (c) 
incubating the first nucleic acid molecule or molecules under conditions 
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sufficient to make a second nucleic acid - molecule or molecules 
complementary to all or a portion of the first nucleic acid molecule or 
molecules, thereby forming one or more double-stranded nucleic acid 
molecules comprising the first and second nucleic acid molecules. In some 
embodiments, the incubation of step (b) is performed at an elevated 
temperature. In some embodiments, conditions may comprise the use of one 
or more labeled nucleotides and the double stranded nucleic acid molecules 
may be labeled. In specific embodiments, the elevated temperature may be 
from about 40°C or greater, from about 45°C or greater, from about 50°C or 
greater, from about 51°C or greater, from about 52°C or greater, from about 
53°C or greater, from about 54°C or greater, from about 55°C or greater, from 
about 56°C or greater, from about 57°C or greater, from about 58°C or greater, 
from about 59°C or greater, from about 60°C or greater, from about 61°C or 
greater, from about 62°C or greater, from about 63°C or greater, from about 
64°C or greater, from about 65°C or greater, from about 66°C or greater, from 
about 67°C or greater, from about 68°C or greater, from about 69°C or greater, 
from about 70°C or greater, from about 71°C or greater, from about 72°C or 
greater, from about 73 °C or greater, from about 74°C or greater, from about 
75°C or greater, from about 76°C or greater, from about 77°C or greater, or 
from about 78°C or greater. An elevated temperature may be within a 
temperature range of from about 40°C to about 45°C, from about 40°C to 
about 48°C, from about 40°C to about 50°C, from about 40°C to about 52°C, 
from about 40°C to about 55°C, from about 40°C to about 58°C, from about 
40°C to about 60°C, from about 40°C to about 65 °C, from about 42°C to 
about 45°C, from about 42°C to about 48°C, from about 42°C to about 50°C, 
from about 42°C to about 52°C, from about 42°C to about 55°C, from about 
42°C to about 58°C, from about 42°C to about 60°C, from about 42°C to 
about 65°C, from about 45°C to about 48°C, from about 45°C to about 50°C, 
from about 45°C to about 52°C, from about 45°C to about 55°C, from about 
45°C to about 58°C, from about 45°C to about 60°C, from about 45°C to 
about 65°C, from about 48°C to about 50°C, from about 48°C to about 52°C, 
from about 48°C to about 55°C, from about 48°C to about 58°C, from about 
48°C to about 60°C, from about 48°C to about 65°C, from about 50°C to 
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about 52°C, from about 50°C to about 55°C, from about 50°C to about 58°C, 
from about 50°C to about 60°C, from about 50°C to about 65°C, from about 
52°C to about 55°C, from about 52°C to about 58°C, from about 52°C to 
about 60°C, from about 52°C to about 65°C, from about 55°C to about 58°C, 
from about 55°C to about 60°C, from about 55°C to about 65°C, from about 
55°C to about 70°C, from about 58°C to about 60°C 5 from about 58°C to 
about 65 °C, from about 58°C to about 70°C. An elevated temperature may be 
within a temperature range from about 37°C to about 75°C, from about 40°C 
to about 75°C, from about 45°C to about 75°C, from about 50°C to about 
75°C, from about 51°C to about 75°C, from about 52°C to about 75°C, from 
about 53°C to about 75°C, from about 54°C to about 75°C, from about 55°C 
to about 75°C In other embodiments, the elevated temperature may be within 
the range of about 50°C to about 70°C, from about 51°C to about 70°C, from 
about 52°C to about 70°C, from about 53°C to about 70°C, from about 54°C 
to about 70°C, from about 55°C to about 70°C, from about 56°C to about 
65°C, from about 56°C to about 64°C or about 56°C to about 62°C. In other 
embodiments, the elevated temperature may be within the range of about 46°C 
to about 60°C, from about 47°C to about 60°C, from about 49°C to about 
60°C, from about 51°C to about 60°C, from about 53°C to about 60°C, or 
from about 54°C to about 60°C Such conditions may involve the use of one 
or more suitable buffers or buffer salts, on or more primers (such as oligo dT 
primers), and one or more nucleotides. Such methods may include the use of 
one or more (e.g., one, two, three, four, five, ten, twelve, fifteen, etc.) DNA 
polymerases as part of the process of making the one or more double-stranded 
nucleic acid molecules. Such DNA polymerases are preferably thermostable 
DNA polymerases and most preferably the nucleic acid synthesis 
accomplished with such DNA polymerases is conducted at elevated 
temperatures, i.e., greater than 37°C. The invention also concerns 
compositions useful for making such double-stranded nucleic acid molecules. 
Such compositions comprise one or more (e.g., one, two, three, four, five, ten, 
twelve, fifteen, twenty, etc.) reverse transcriptases of the invention and 
optionally one or more DNA polymerases, a suitable buffer, one or more (e.g., 
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one, two, three, four, five, ten, twelve, fifteen, etc.) primers, and/or one or 
more (e.g., one, two, three, four, five, etc.) nucleotides. The invention further 
includes nucleic acid molecules prepared by the above methods and reaction 
mixtures used in and formed by such methods. 
[0054] The invention also relates to methods for amplifying a nucleic acid 

molecule. Such amplification methods comprise mixing the double-stranded 
nucleic acid molecule or molecules produced as described above with one or 
more (e.g., one, two, three, four, five, ten, twelve, fifteen, etc.) DNA 
polymerases (preferably thermostable DNA polymerases) and incubating the 
mixture under conditions sufficient to amplify the double-stranded nucleic 
acid molecule. In a first embodiment, the invention concerns a method for 
amplifying a nucleic acid molecule,vthe method comprising (a) mixing one or 
more (e.g., one, two, three, four, five, ten, twelve, fifteen, twenty, etc.) nucleic 
acid templates (preferably one or more RNA or mRNA templates and more 
preferably a population of mRNA templates) with one or more reverse 
transcriptases of the invention and with one or more DNA polymerases and (b) 
incubating the mixture under conditions sufficient to amplify nucleic acid 
molecules complementary to all or a portion of the one or more templates. In 
some embodiments, the incubation of step (b) is performed at an elevated 
temperature. In specific embodiments, the elevated temperature may be from 
about 40°C or greater, from about 45°C or greater, from about 50°C or greater, 
from about 51°C or greater, from about 52°C or greater, from about 53°C or 
greater, from about 54°C or greater, from about 55°C or greater, from about 
56°C or greater, from about 57°C or greater, from about 58°C or greater, from 
about 59°C or greater, from about 60°C or greater, from about 61°C or greater, 
from about 62°C or greater, from about 63 °C or greater, from about 64°C or 
greater, from about 65 °C or greater, from about 66°C or greater, from about 
67°C or greater, from about 68°C or greater, from about 69°C or greater, from 
about 70°C or greater, from about 71°C or greater, from about 72°C or greater, 
from about 73°C or greater, from about 74°C or greater, from about 75°C or 
greater, from about 76°C or greater, from about 77°C or greater, or from about 
78°C or greater. An elevated temperature may be within a temperature range 
of from about 40°C to about 45°C, from about 40°C to about 48°C, from about 
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40°C to about 50°C, from about 40?C to about 52°C, from about 40°C to 
about 55°C, from about 40°C to about 58°C, from about 40°C to about 60°C, 
from about 40°C to about 65°C, from about 42°C to about 45°C, from about 
42°C to about 48°C, from about 42°C to about 50°C, from about 42°C to 
about 52°C, from about 42°C to about 55°C, from about 42°C to about 58°C, 
from about 42°C to about 60°C, from about 42°C to about 65 °C, from about 
45°C to about 48°C, from about 45°C to about 50°C, from about 45°C to 
about 52°C, from about 45°C to about 55°C, from about 45°C to about 58°C, 
from about 45°C to about 60°C, from about 45°C to about 65°C, from about 
48°C to about 50°C, from about 48°C to about 52°C, from about 48°C to 
about 55°C, from about 48°C to about 58°C, from about 48°C to about 60°C, 
from about 48°C to about 65°C, from about 50°C to about 52°C, from about 
50°C to about 55°C, from about 50°C to about 58°C, from about 50°C to 
about 60°C, from about 50°C to about 65°C, from about 52°C to about 55°C, 
from about 52°C to about 58°C, from about 52°C to about 60°C, from about 
52°C to about 65°C, from about 55°C to about 58°C, from about 55°C to 
about 60°C, from about 55°C to about 65°C, from about 55°C to about 70°C, 
from about 58°C to about 60°C, from about 58°C to about 65°C, from about 
58°C to about 70°C. An elevated temperature may be within a temperature 
range from about 37°C to about 75°C, from about 40°C to about 75°C, from 
about 45°C to about 75°C, from about 50°C to about 75°C, from about 51°C 
to about 75°C, from about 52°C to about 75°C, from about 53°C to about 
75°C, from about 54°C to about 75°C, from about 55°C to about 75°C. In 
other embodiments, the elevated temperature may be within the range of about 
50°C to about 70°C, from about 51°C to about 70°C, from about 52°C to 
about 70°C, from about 53°C to about 70°C, from about 54°C to about 70°C, 
from about 55°C to about 70°C, from about 56°C to about 65°C, from about 
56°C to about 64°C or about 56°C to about 62°C. In other embodiments, the 
elevated temperature may be within the range of about 46°C to about 60°C, 
from about 47°C to about 60°C, from about 49°C to about 60°C, from about 
51°C to about 60°C, from about 53°C to about 60°C, or from about 54°C to 
about 60°C. 
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[0055] Preferably, reverse transcriptases of the .invention, used in methods of 

the invention, and/or present in compositions of the invention (1) are reduced 
or substantially reduced in RNase H activity, (2) are reduced or substantially 
reduced in TdT activity, and/or (3) exhibit increased fidelity. Preferably, 
DNA polymerases used with the invention may comprise a first DNA 
polymerase having 3' exonuclease activity and a second DNA polymerase 
having substantially reduced 3' exonuclease activity. The invention further 
includes nucleic acid molecules prepared by the above methods and reaction 
mixtures used in and formed by such methods. 

[0056] The invention also concerns compositions comprising one or more 

reverse transcriptases of the invention and one or more DNA polymerases for 
use in amplification reactions. Such compositions may further comprise one 
or more nucleotides and/or a buffer suitable for amplification. Compositions 
of the invention may also comprise one or more oligonucleotide primers. 
Compositions of the invention may further include nucleic acid molecules 
prepared by the above methods and reaction mixtures used in and formed by 
such methods. 

[0057] The invention is also directed to nucleic acid molecules (particularly 

single- or double-stranded cDNA molecules) or amplified nucleic acid 
molecules produced according to the above-described methods and to vectors 
(particularly expression vectors) comprising these nucleic acid molecules or 
amplified nucleic acid molecules. 

[0058] The invention is further directed to recombinant host cells comprising 

the above-described nucleic acid molecules, amplified nucleic acid molecules 
or vectors. Examples of such host cells include bacterial cells, yeast cells, 
plant cells and animal cells (including insect cells and mammalian cells). 

[0059] The invention is additionally directed to methods of producing 

polypeptides encoded by the nucleic acid molecules produced by the methods 
of the invention. Such methods include those comprising culturing the above- 
described recombinant host cells and isolating the encoded polypeptides. The 
invention further includes polypeptides produced by such methods. 

[0060] The invention also concerns methods for sequencing one or more (e.g., 

one, two, three, four, five, ten, twelve, fifteen, etc.) nucleic acid molecules 
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using compositions or enzymes of the invention. Such methods comprise (a) 
mixing one or more nucleic acid molecules (e.g., one or more RNA or DNA 
molecules) to be sequenced with one or more reverse transcriptases of the 
invention, and, optionally, one or more nucleotides, one or more terminating 
agents, such as one or more dideoxynucleoside triphosphates, and one or more 
primers; (b) incubating the mixture under conditions sufficient to synthesize a 
population of nucleic acid molecules complementary to all or a portion of the 
one or more (e.g., one, two, three, four, five, ten, twelve, fifteen, twenty, 
thirty, fifty, one hundred, two hundred, etc.) nucleic acid molecules to be 
sequenced; and (c) separating the population of nucleic acid molecules to 
determine the nucleotide sequence of all or a portion of the one or more 
nucleic acid molecules to be sequenced. Such methods may also comprise (a) 
mixing a nucleic acid molecule (e.g., one or more RNA or DNA molecules) to 
be sequenced with one or more primers, one or more reverse transcriptases of 
the invention, one or more nucleotides and one or more terminating agents, 
such as one or more dideoxynucleoside triphosphates; (b) incubating the 
mixture under conditions sufficient to synthesize a population of nucleic acid 
molecules complementary to all or a portion of the nucleic acid molecule to be 
sequenced; and (c) separating members of the population of nucleic acid 
molecules to determine the nucleotide sequence of all or a portion of the 
nucleic acid molecule to be sequenced. In some embodiments, such 
incubation may be performed at elevated temperatures as described herein. 
The invention further includes sequence data generated by the above methods, 
as well as methods for generating such sequence data, and reaction mixtures 
used in and formed by such methods. 

The invention is also directed to kits for use in methods of the 
invention. Such kits can be used for making, sequencing or amplifying 
nucleic acid molecules (single- or double-stranded), preferably at the elevated 
temperatures described herein. Kits of the invention may comprise a carrier, 
such as a box or carton, having in close confinement therein one or more (e.g., 
one, two, three, four, five, ten, twelve, fifteen, etc.) containers, such as vials, 
tubes, bottles and the like. In kits of the invention, a first container contains 
one or more of the reverse transcriptase enzymes of the present invention. 
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Kits of the invention may also comprise, in the same or different containers, 
one or more DNA polymerases (preferably thermostable DNA polymerases), 
one or more (e.g., one, two, three, four, five, ten, twelve, fifteen, etc.) suitable 
buffers for nucleic acid synthesis, one or more nucleotides and one or more 
(e.g., one, two, three, four, five, ten, twelve, fifteen, etc.) oligonucleotide 
primers. Alternatively, the components of the kit may be divided into separate 
containers (e.g., one container for each enzyme and/or component). Kits of 
the invention also may comprise instructions or protocols for carrying out the 
methods of the invention. In preferred kits of the invention, the reverse 
transcriptases are reduced or substantially reduced in RNase H activity (or 
lacking or having undetectable RNase H activity), and are most preferably 
selected from the group consisting of MJMLV RNase H- reverse transcriptase, 
RSV RNase H- reverse transcriptase, AMV RNase H- reverse transcriptase, 
RAV RNase H- reverse transcriptase, MAV RNase H- reverse transcriptase 
and HIV RNase H- reverse transcriptase. In other preferred kits of the 
invention, the reverse transcriptases are reduced or substantially reduced in 
TdT activity, and/or exhibit increased fidelity, as described elsewhere herein. 

[0062] In additional preferred kits of the invention, the enzymes (reverse 

transcriptases and/or DNA polymerases) in the containers are present at 
working concentrations. 

[0063] Thus, the invention is further directed to kits for use in reverse 

transcription, amplification or sequencing of a nucleic acid molecule, the kit 
comprising one or more reverse transcriptases of the invention. 

[0064} As indicated above, kits of the invention may contain any number of 

various components for practicing methods of the invention. One example of 
such a component is instructions for performing methods of the invention. 
Example of such instructions include those which direct individuals using the 
kits to perform methods for amplifying nucleic acid molecules using one or 
more reverse transcriptases of the invention. 

[0065] As one skilled in the art would recognize, the full text of these 

instructions need not be included with the kit. One example of a situation in 
which kits of the invention would not contain such full length instructions is 
where directions are provided which inform individuals using the kits where to 
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obtain instructions for using the kit. Thus, instructions for performing 
methods of the invention may be obtain from internet web pages, separately 
sold or distributed manuals or other product literature, etc. The invention thus 
includes kits which direct kit users to locations where they can find 
instructions which are not directly packaged and/or distributed with the kits. 
These instructions may be in any form including, but not limited to, electronic 
or printed forms. 

| The invention thus also provides, in part, kits for performing methods 

using the reverse transcriptases of the invention. In specific embodiments, 
kits of the invention contain instructions for performing methods for 
amplifying and/or sequencing nucleic acid molecules. These methods will 
often involve reacting RNA molecules with one or more reverse transcriptases 
of the invention. 

In specific embodiments, reverse transcriptases of kits of the invention 
may comprise one or more modifications or mutations at positions 
corresponding to amino acids selected from the group consisting of: 

(a) leucine 52 of M-MLV reverse transcriptase; 

(b) tyrosine 64 of M-MLV reverse transcriptase; 

(c) lysine 152 of M-MLV reverse transcriptase; 

(d) arginine 204 of M-MLV reverse transcriptase; 

(e) methionine 289 of M-MLV reverse transcriptase; 

(f) threonine 306 of M-MLV reverse transcriptase; and 

(g) phenylalanine 309 of M-MLV reverse transcriptase. 

Reverse transcriptases of the invention include any reverse 
transcriptase having one or a combination of the properties described herein. 
Such properties include, but are not limited to, enhanced thermostability, 
reduced or eliminated RNase H activity, reduced terminal deoxynucleotidyl 
transferase activity, and/or increased fidelity. Such reverse transcriptases 
include retroviral reverse transcriptases, bacterial reverse transcriptases, 
retro transpo son reverse transcriptases (e.g., reverse transcriptases of the Tyl 
and/or Ty3 retrotransposons), and DNA polymerases having reverse 
transcriptase activity. Preferred reverse transcriptases of the invention include 
a single and multi-subunit reverse transcriptase and preferably retroviral 
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reverse transcriptases. In particular, the invention relates to M-MLV -reverse 
transcriptases and ASLV-reverse transcriptases (such as AMV-RT and RSV- 
RT). Such reverse transcriptases of the invention preferably have reduced, 
substantially reduced, or no detectable RNase H activity. 
[0069] Other embodiments of the present invention will be apparent to one of 

ordinary skill in light of the following drawings and description of the 
invention, and of the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS/FIGURES 

[0070] Fig. 1 is a map of plasmid pBAD-6-His-M-MLV H- (Fl). 

[0071] Fig. 2A is a linear representation of the coding sequence of the M- 

MLV reverse transcriptase showing the locations of the restriction enzyme 
cleavage sites used to generate the segments of the gene used to generate 
mutations. Fig. 2B is a schematic representation showing the insertion of a 
mutagenized PCR fragment into the coding sequence of the remaining portion 
of the reverse transcriptase gene. 

[0072] Fig. 3 represents a scanned phosphoimage of an extension assay using 

(1) Superscript™ II reverse transcriptase, and (2) F309N. The [ 32 P]-labeled 
1 8-mer primer annealed to a 47-mer DNA template (5 nM) was extended by 
equal units of reverse transcriptase at 37°C for 30 minutes as seen in the 
extension reactions with all 4 nucleotides. The extension reactions were 
analyzed by denaturing 6% gel electrophoresis. P, non-extended primer. 

[0073] Fig. 4 represents a scanned phosphoimage showing a TdT extension 

assay of Superscript™ II reverse transcriptase and the mutants F309N, 
T197E, and Y133A. The [ 32 P]-labeled 1 8-mer primer annealed to a 47-mer 
DNA template (5 nM) was extended with decreasing units of reverse 
transcriptase (lane (1) 646 units, lane (2) 200 units, lane (3) 50 units, and lane 
(4) 20 units) at 37°C for 30 minutes with all four nucleotides (see the Methods 
section below in Example 3). The extension reactions were analyzed by 
denaturing 6% gel electrophoresis. In this assay, extension past the 47 
nucleotide templates is considered non-template directed addition or TdT 
activity. P, non-extended primer. 



-37- 

[0074] Fig. 5 represents a scanned phosphoimage showing misinsertion assays 

of Superscript™ II reverse transcriptase (1) and mutant protein F309N 
reverse transcriptase (2) with DNA template. The [ 32 P]-labeled 18-mer primer 
annealed to a 47-mer DNA template (5 nM) was extended by equal units of 
reverse transcriptase protein at 37°C for 30 min. as seen in the extension 
reactions with all four nucleotides. The extension reactions were also 
performed in the presence of only 3 complementary dNTPs; minus dCTP, 
minus dATP, minus TTP, and minus dGTP. The extension reactions were 
analyzed by denaturing 6% gel electrophoresis. In this assay, the higher 
efficiency of elongation of terminated primer with only three nucleotides will 
reflect the lower fidelity of the Superscript™ II reverse transcriptase 
assayed. P, non-extended primer. 
[0075] Fig. 6 represents a scanned phosphoimage showing a misinsertion 

assay of Superscript™ II reverse transcriptase (1) and mutant protein 
T197A/F309N reverse transcriptase (2) and V223H/F309N (3) with DNA 
template. The [ 32 P]-labeled 18-mer primer annealed to a 47-mer DNA 
template (5 nM) was extended by equal units of reverse transcriptase protein at 
37°C for 30 min. as seen in the extension reactions with all four nucleotides. 
The extension reactions were also performed in the presence of only 3 
complementary dNTPs; minus dATP, and minus dCTP. The extension 
reactions were analyzed by denaturing 6% gel electrophoresis. In this assay, 
the higher efficiency of elongation of terminated primer with only three 
nucleotides will reflect the lower fidelity of the Superscript™ II reverse 
transcriptase assayed. P, non-extended primer. 
[0076] Figs. 7A-7C show representative results obtained from the screen for 

thermal stable RT mutants. Lysates of mutants were assayed for RT activity 
in a 96-well plate format. 32 P-Labeled DNA product was trapped on a 
membrane and the amount of radioactivity present was quantified with a 
phosphorimager. Fig. 7A shows the results of an initial screen of RT mutants 
in 4, 96-well plates. Heat pretreatment of lysates was at 58°C for 10 min. RT 
mutants that retained the most activity after heat treatment at 58°C were 
selected and lysates were screened again and the results are shown in 7B. A 
duplicate screen was performed with no heat pretreatment (Fig. 7B upper 
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panel) and heat pretreatment at 58°C (Fig.~7B lower panel). RT mutants with 
the highest resistance to heat inactivation in crude extracts were purified by 
nickel-affinity chromatography and screened again for RT activity and the 
results are shown in Fig. 7C. The results after heat treatment at 37°C are 
shown in Fig. 7C in the upper row, after heat treatment at 53°C in Fig. 7C 
middle row, and after heat treatment at 58°C in 7C bottom row. 

[0077] Fig. 8 shows a comparison of the thermal inactivation profiles of His 6 

H- RT and His 6 H- H204R T306K RT in crude extracts. Crude extracts were 
subjected to a heat treatment in a 96-well plate for 5 min. The temperature of 
the heat treatment increased from left to right, except that the wells on the far 
right were not heat treated. 

[0078] Fig, 9 is a ribbon diagram of the crystal structure of amino acids 193 to 

232 of M-MLV RT showing the sites of some of the amino acids identified by 
the methods of the present invention. Potential interactions of arginine 
substituted for histidine at M-MLV RT position 204 in a-helix H with E201 or 
T128. The catalytic site amino acids D224 and D225 in the turn between piO 
and pil are also shown. The three-dimensional structure is taken from 
Georgiadis, et al., (1995) Structure 3, 879-892, Thus, the invention also 
includes reverse transcriptases having one or more mutations or modifications 
in various regions including the a-helix H region. 

[0079] Figures 10A and 10B are graphs of reverse transcriptase activity as a 

function of Mg 2+ concentration (Fig. 10A) and KC1 concentration (Fig. 10B). 
The DNA polymerase assay for SUPERSCRIPT™ III (SuIII) RT was conducted 
at 37°C or 50°C for 10 minutes under various concentrations of A) Mg 2+ or B) 
KC1. Superscript™ II (SuII) at 37°C included for comparison). 

[0080] Figures 1 1 A and 1 IB show autoradiograms of TdT activity measure by 

extension for 60 minutes at various temperatures of a labeled DNA primer on 
DNA (Fig. 11 A) or RNA (Fig. 11B) template forming a blunt end. T is 
template only, Lanes marked (-) is T-P plus enzyme without dNTPs. Since 
Superscript™ III is more thermostable, its TdT activity appears greater at 50 
degrees than SUPERSCRIPT™ II. 
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[00811 Figures 12A, 12B and 12C are graphs of RT activity as a function of 

incubation time. Fig. 12A shows the data obtained at 50°C, Fig. 12B shows 
the data obtained at 55°C, and Fig. 12C shows the data obtained at 60°C. 

[0082] Figure 13 is an autoradiogram comparing reverse transcriptase activity 

of a variety of commercially available reverse transcriptase enzymes at 45°C, 
50°C, and 55°C. Superscript™ III is designated SS III and Superscript™ 
II is designated SS II. 

[0083] Figure 14 is a photograph of ethidium bromide stained gels showing 

the results of the evaluation of the pH of the first strand buffer. RT reactions 
with first-strand buffers at pHs from 8.0 to 8.8 were performed with 500 ng of 
total Hela RNA and 200 units of SUPERSCRIPT™ II (SS II) or 400 units of 
Superscript™ HI (LEFN RT, which contains an N-terminal tag sequence = 
MASGTGGQQMGRDLYDDDDKH (SEQ ID NO:3) and the following point 
mutations H204R, T306K, M289L, and F309N). 2jil of the resulting cDNA 
were then added to 50 PCR reactions containing the BF 2.4 kb or Pol 8 6.8 
kb primer set. Resulting PCR products were then run on a 0.8% agarose gel 
containing 0.4 mg/ml ethidium bromide. 

[0084] Figure 15 is a photograph of ethidium bromide stained gels showing 

the results of the evaluation of the effect of temperature on the reverse 
transcription reaction with various reverse transcriptases. SUPERSCRIPT™ II 
(SS II) was compared to the His-tagged EFN reverse transcriptase (His tag 
sequence = MGGSHHHHHHGMASMTGGQQMGRDLYDDDDKH, amino 
acids 1-32 of SEQ ID NO:2 and Table 3, EFN mutations are H204R, T306K, 
and F309N), His-tagged LEFN reverse transcriptase (same His tag sequence, 
LEFN mutations are H204R, T306K, M289L, and F309N), and to 
SUPERSCRIPT™ III, which is the tagged, no His LEFN reverse transcriptase 
(tag sequence - MASGTGGQQMGRDLYDDDDKH (SEQ ID NO:3), LEFN 
mutations are H204R, T306K, M289L, and F309N). 

(0085] Figure 16 is a photograph of ethidium bromide stained gels showing 

the results of the evaluation of the effect of reverse transcriptase concentration 
on the reverse transcription reaction with SUPERSCRIPT™ III designated LEFN 
which contains the tag sequence = MASGTGGQQMGRDLYDDDDKH (SEQ 
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ID NO:l), and LEFN mutations, which are H204R, T306K, M289L, and 
F309N. 

[0086) Figure 17 is a photograph of ethidium bromide stained gels showing 

the results of the comparison of hot start RT-PCR amplification by 
Superscript™ II (Panel A) or Superscript™ III (Panel B). Lanes (in 
duplicate) 1, 4, 7, and 10 are products reverse transcribed at 42 °C. Lanes 2, 5, 
8, and 1 1 are products reverse transcribed at 50 °C. Lanes 3, 6, 9, and 12 are 
products transcribed at 55 °C. Lanes 1-3 are the result of RNAs reverse 
transcribed by gene-specific priming from FGF, lanes 4-6 CBS 2.4, lanes 7-9 
from TOP 3.2, lanes 10-12 VIN 4.6. Arrows indicate expected product sizes 
of 240 bp, 2390 bp, 3162 bp, and 4641bp. Superscript™ III contains the tag 
sequence = MASGTGGQQMGRDLYDDDDKH (SEQ ID NO:3), and the 
LEFN mutations, which are H204R, T306K, M289L, and F309N. 

[0087] Figure 18 shows the results of RT-PCR performed with varying 

amounts Superscript™ III from 25 units to 250 units per reaction with a 
variety of primer sets. 

[0088] Figure 19 shows a comparison of Superscript™ II (SS II) and His 

tagged LEFN RT in RT-PCR using 200 or 400 units in the first strand 
reaction. His tagged LEFN has the His tag sequence = 
MGGSHHHHHHGMASMTGGQQMGRDLYDDDDKH, amino acids 1-32 
of SEQ ID NO:2 and Table 3, and the LEFN mutations, which are H204R, 
T306K, M289L, and F309N). 

[0089] Figure 20 shows the use of Superscript™ III (LEFN RT) in RT-PCR 

with varying amounts of RT in the first strand reaction. SUPERSCRIPT™ III 
contains the tag sequence = MASGTGGQQMGRDLYDDDDKH (SEQ ID 
NO:3), and the LEFN mutations, which are H204R, T306K, M289L, and 
F309N. 

[0090] Figure 21 shows the results of a comparison of various primers in RT- 

PCR reactions using the polypeptides of the invention. EFN contains the tag 
sequence = MASGTGGQQMGRDLYDDDDKH (SEQ ID NO:3), and the 
EFN mutations, which are H204R, T306K, and F309N. 
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DETAILED DESCRIPTION OF THE INVENTION 



[0091] In the description that follows, a number of terms used in recombinant 

DNA, virology and immunology are utilized. In order to provide a clearer and 
consistent understanding of the specification and claims, including the scope 
to be given such terms, the following definitions are provided. 

[0092] Cloning vector. As used herein "cloning vector" means a nucleic acid 

molecule such as plasmid, cosmid, phage, phagemid or other nucleic acid 
molecule which is able to replicate autonomously in a host cell, and which is 
characterized by one or a small number of recognition sequences, {e.g., 
restriction endonuclease recognition sites, recombination sites, topoisomerase 
recognition sites,^ etc.) . at which such nucleic acid sequences may be 
manipulated in a determinable fashion, and into which a nucleic acid segment 
of interest may be inserted in order to bring about its replication and cloning. 
The cloning vector may further contain a marker suitable for use in the 
identification of cells transformed with the cloning vector. Markers, for 
example, are genes that confer a recognizable phenotype on host cells in 
which such markers are expressed. Commonly used markers include, but are 
not limited to, antibiotic resistance genes such as tetracycline resistance or 
ampicillin resistance. 

[0093] Expression vector. As used herein "expression vector" means a 

nucleic acid molecule similar to a cloning vector but which may additionally 
comprise nucleic acid sequences capable of enhancing and/or controlling the 
expression of a gene or other nucleic acid molecule which has been cloned 
into it, after transformation into a host. The additional nucleic acid sequences 
may comprise promoter sequences, repressor binding sequences and the like. 
The cloned gene or nucleic acid molecule is usually operably linked to one or 
more {e.g., one, two, three, four, etc.) of such control sequences such as 
promoter sequences. 

[0094] Recombinant host. As used herein "recombinant" means any 

prokaryotic or eukaryotic or microorganism which contains the desired cloned 
genes or nucleic acid molecules, for example, in an expression vector, cloning 
vector or any nucleic acid molecule. The term "recombinant host" is also 
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meant to include those host cells which have been genetically engineered to 
contain the desired gene or other nucleic acid molecule on the host 
chromosome or genome. 
[0095] Host. As used herein "host" means any prokaryotic or eukaryotic cell 

or organism that is the recipient of a replicable expression vector, cloning 
vector or any nucleic acid molecule. The nucleic acid molecule may contain, 
but is not limited to, a structural gene, a promoter and/or an origin of 
replication. 

[0096] Promoter. As used herein "promoter" means a nucleic acid sequence 

generally described as the 5' region of a gene, located proximal to the start 
codon which is capable of directing the transcription of a gene or other nucleic 
acid molecule. At the promoter region, transcription of an adjacent gene(s) or 
nucleic acid(s) is initiated. 

[0097] Gene. As used herein "gene" means a nucleic acid sequence that 

contains information necessary for expression of a polypeptide or protein. It 
includes the promoter and the structural gene as well as other sequences 
involved in expression of the protein. 

[0098] Structural gene. As used herein "structural gene" means a DNA or 

other nucleic acid sequence that is transcribed into messenger RNA that is 
then translated into a sequence of amino acids characteristic of a specific 
polypeptide. 

[0099] Operably linked. As used herein "operably linked" means that a 

nucleic acid element is positioned so as to influence the initiation of 
expression of the polypeptide encoded by the structural gene or other nucleic 
acid molecule. 

[0100] Expression. As used herein "expression" refers to the process by 

which a gene or other nucleic acid molecule produces a polypeptide. It 
includes transcription of the gene or nucleic acid molecule into messenger 
RNA (mRNA) and the translation of such mRNA into polypeptide(s). 

[0101] Substantially Pure. As used herein "substantially pure" means that the 

desired material is essentially free from contaminating cellular components 
which are associated with the desired material in nature. In a preferred aspect, 
a reverse transcriptase of the invention has 25% or less, preferably 15% or 
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less, more preferably 10% or less, more preferably 5% or less, and still more 
preferably 1% or less contaminating cellular components. In another aspect, 
the reverse transcriptases of the invention have no detectable protein 
contaminants when 200 units of reverse transcriptase are run on a protein gel 
{e.g., SDS-PAGE) and stained with Comassie blue. Contaminating cellular 
components may include, but are not limited to, enzymatic activities such as 
phosphatases, exonucleases, endonucleases or undesirable DNA polymerase 
enzymes. Preferably, reverse transcriptases of the invention are substantially 
pure. 

[0102] Substantially isolated. As used herein "substantially isolated" means 

that the polypeptide of the invention is essentially free from contaminating 
proteins, which may be associated with the polypeptide of the invention in 
nature and/or in a recombinant host. In one aspect, a substantially isolated 
reverse transcriptase of the invention has 25% or less, preferably 15% or less, 
more preferably 10% or less, more preferably 5% or less, and still more 
preferably 1% or less contaminating proteins. In another aspect, in a sample 
of a substantially isolated polypeptide of the invention, 75% or greater 
(preferably 80%, 85%, 90%, 95%, 98%, or 99% or greater) of the protein in 
the sample is the desired reverse transcriptase of the invention. The 
percentage of contaminating protein and/or protein of interest in a sample may 
be determined using techniques known in the art, for example, by using a 
protein gel {e.g., SDS-PAGE) and staining the gel with a protein dye {e.g., 
Coomassie blue, silver stain, amido black, etc.). In another aspect, the reverse 
transcriptases of the invention have no detectable protein contaminants when 
200 units of reverse transcriptase are run on a protein gel {e.g., SDS-PAGE) 
and stained with Comassie blue. 

[0103] Primer. As used herein "primer" refers to a single-stranded 

oligonucleotide that is extended by covalent bonding of nucleotide monomers 
during amplification or polymerization of a DNA molecule. 

[0104] Template. The term "template" as used herein refers to a double- 

stranded or single-stranded nucleic acid molecule which is to be amplified, 
copied or sequenced. In the case of a double-stranded DNA molecule, 
denaturation of its strands to form single-stranded first and second strands may 
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be performed before these molecules are amplified, copied or sequenced. A 
primer complementary to a portion of a nucleic acid template is hybridized 
under appropriate conditions and a nucleic acid polymerase, such as the 
reverse transcriptase enzymes of the invention, may then add nucleotide 
monomers to the primer thereby synthesizing a nucleic acid molecule 
complementary to said template or a portion thereof. The newly synthesized 
nucleic acid molecule, according to the invention, may be equal or shorter in 
length than the original template. Mismatch incorporation during the 
synthesis or extension of the newly synthesized nucleic acid molecule may 
result in one or a number of mismatched base pairs. Thus, the synthesized 
nucleic acid molecule need not be exactly complementary to the template. 

[0105] Incorporating. The ^ term " incorporating" as used herein means 

becoming a part of a nucleic acid molecule or primer. 

[0106] Oligonucleotide. "Oligonucleotide" refers to a synthetic or natural 

molecule comprising a covalently linked sequence of nucleotides which are 
joined by a phosphodiester bond between the 3' position of the pentose of one 
nucleotide and the 5' position of the pentose of the adjacent nucleotide. 

[0107] Nucleotide. As used herein "nucleotide" refers to a 

base-sugar-phosphate combination. Nucleotides are monomelic units of a 
nucleic acid sequence (DNA and RNA). The term nucleotide includes 
ribonucleoside triphosphates ATP, UTP, CTG, GTP and deoxyribonucleoside 
triphosphates such as dATP, dCTP, dITP, dUTP, dGTP, dTTP, or derivatives 
thereof. Such derivatives include, for example, [aSJdATP, 7-deaza-dGTP and 
7-deaza-dATP, and nucleotide derivatives that confer nuclease resistance on 
the nucleic acid molecule containing them. The term nucleotide as used 
herein also refers to dideoxyribonucleoside triphosphates (ddNTPs) and their 
derivatives. Illustrated examples of dideoxyribonucleoside triphosphates 
include, but are not limited to, ddATP, ddCTP, ddGTP, ddlTP, and ddTTP. 
According to the present invention, a "nucleotide" may be unlabeled or 
detectably labeled by well known techniques. Detectable labels include, for 
example, radioactive isotopes, fluorescent labels, chemiluminescent labels, 
bioluminescent labels and enzyme labels. Fluorescent labels of nucleotides 
may include but are not limited fluorescein, 5-carboxyfluorescein (FAM), 
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2 ' 7 ' -dimethoxy-4 5 5 -dichloro-6-carboxy fluorescein (JOE), rhodamine, 6- 
carboxyrhodamine (R6G), N,N,N' 5 N'-tetramethyl-6-carboxyrhodamine 
(TAMRA), 6-carboxy-X-rhodamine (ROX), 4-(4'dimethylaminophenylazo) 
benzoic acid (DABCYL), Cascade Blue, Oregon Green, Texas Red, Cyanine 
and 5-(2 , -aminoethyl)aminonaphthalene-l-sulfonic acid (EDANS). Specific 
examples of fluroescently labeled nucleotides include [R6G]dUTP, 
[TAMRA]dUTP, [R110]dCTP, [R6G]dCTP, [TAMRA]dCTP, [JOEJddATP, 
[R6G]ddATP, [FAMJddCTP, [Rl 10]ddCTP, [TAMRA]ddGTP, 
[ROX]ddTTP, [dR6G]ddATP, [dR110]ddCTP, [dTAMRA]ddGTP, and 
[dROXJddTTP available from Perkin Elmer, Foster City, CA. FluoroLink 
DeoxyNucleotides, FluoroLink Cy3-dCTP, FluoroLink Cy5-dCTP, 
FluoroLink FluorX-dCTP, FluoroLink Cy3-dUTP, and FluoroLink Cy5-dUTP 
available from Amersham Arlington Heights, IL; Fluorescein- 1 5 -d ATP, 
Fluorescein- 1 2-dUTP, Tetramethyl-rodamine-6-dUTP, IR770- 9-dATP, 
Fluorescein- 12-ddUTP, Fluorescein- 12-UTP, and Fluorescein-15-2'-dATP 
available from Boehringer Mannheim Indianapolis, IN; and ChromaTide 
Labeled Nucleotides, BODIPY-FL-14-UTP, BODEPY-FL-4-UTP, BODIPY- 
TMR-14-UTP, BODIPY-TMR-14-dUTP, BODIPY-TR-14-UTP, BODIPY- 
TR-14-dUTP, Cascade Blue-7-UTP, Cascade Blue-7-dUTP, fluorescein- 12- 
UTP, fluorescein-12-dUTP, Oregon Green 488-5-dUTP, Rhodamine Green-5- 
UTP, Rhodamine Green-5-dUTP, tetramethylrhodamine-6-UTP, 
tetramethylrhodamine-6-dUTP, Texas Red-5-UTP, Texas Red-5-dUTP, and 
Texas Red-12-dUTP available from Molecular Probes, Eugene, OR. 
[0108] Probes. The term probes refer to single or double stranded nucleic acid 

molecules or oligonucleotides which are detectably labeled by one or more 
detectable markers or labels. Such labels or markers may be the same or 
different and may include radioactive labels, fluorescent labels, 
chemiluminescent labels, bioluminescent labels and enzyme labels, although 
one or more fluorescent labels (which are the same or different) are preferred 
in accordance with the invention. Probes have specific utility in the detection 
of nucleic acid molecules by hybridization and thus may be used in diagnostic 
assays. 
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Hybridization. . As used herein, hybridization (hybridizing) refers to 
the pairing of two complementary single-stranded nucleic acid molecules 
(RNA and/or DNA) to give a double-stranded molecule. As one skilled in the 
art will recognize, two nucleic acid molecules may be hybridized, although the 
base pairing is not completely complementary. Accordingly, mismatched 
bases do not prevent hybridization of two nucleic acid molecules provided that 
appropriate conditions, well known in the art, are used. 

Thermostable Reverse Transcriptase. For the purposes of this 
disclosure, a thermostable reverse transcriptase includes a reverse transcriptase 
which retains a greater percentage or amount of its activity after a heat 
treatment than is retained by a reverse transcriptase that has wild-type 
thermostability after an identical treatment. Thus, a reverse transcriptase 
having increased/enhanced thermostability may be defined as a reverse 
transcriptase having any increase in thermostability, preferably from about 1 .2 
to about 10,000 fold, from about L5 to about 10,000 fold, from about 2 to 
about 5,000 fold, or from about 2 to about 2000 fold (preferably greater than 
about 5 fold, more preferably greater than about 10 fold, still more preferably 
greater than about 50 fold, still more preferably greater than about 100 fold, 
still more preferably greater than about 500 fold, and most preferably greater 
than about 1000 fold) retention of activity after a heat treatment sufficient to 
cause a reduction in the activity of a reverse transcriptase that is wild-type for 
thermostability. Preferably, the mutant or modified reverse transcriptase of 
the invention is compared to the corresponding unmodified or wild-type 
reverse transcriptase to determine the relative enhancement or increase in 
thermostability. For example, after a heat treatment at 52°C for 5 minutes, a 
thermostable reverse transcriptase may retain approximately 90% of the 
activity present before the heat treatment, whereas a reverse transcriptase that 
is wild-type for thermostability may retain 10% of its original activity. 
Likewise, after a heat treatment at 53 °C for five minutes, a thermostable 
reverse transcriptase may retain approximately 80% of its original activity, 
whereas a reverse transcriptase that is wild-type for thermostability may have 
no measurable activity. Similarly, after a heat treatment at 50°C for five 
minutes, a thermostable reverse transcriptase may retain approximately 50%, 
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approximately 55%, approximately 60%, approximately 65%, approximately 
70%, approximately 75%, approximately 80%, approximately 85%, 
approximately 90%, or approximately 95% of its original activity, whereas a 
reverse transcriptase that is wild-type for thermostability may have no 
measurable activity or may retain 10%, 15% or 20% of its original activity. In 
the first instance {i.e., after heat treatment at 52°C for 5 minutes), the 
thermostable reverse transcriptase would be said to be 9-fold more 
thermostable than the wild-type reverse transcriptase. Examples of conditions 
which may be used to measure thermostability of reverse transcriptases are set 
out below, for example, in the Examples. 

[0111] The thermostability of a reverse transcriptase can be determined by 

comparing the residual activity of a sample of the reverse transcriptase that has 
been subjected to a heat treatment, i.e., incubated at 52°C for a given period of 
time, for example, five minutes, to a control sample of the same reverse 
transcriptase that has been incubated at room temperature for the same length 
of time as the heat treatment. Typically the residual activity may be measured 
by following the incorporation of a radiolabled deoxyribonucleotide into an 
oligodeoxyribonucleotide primer using a complementary oligoribonucleotide 
template. For example, the ability of the reverse transcriptase to incorporate 
[(X- 32 P]-dGTP into an oligo-dG primer using a poly(riboC) template may be 
assayed to determine the residual activity of the reverse transcriptase. 

[0112] In another aspect, thermostable reverse transcriptases of the invention 

may include any reverse transcriptase which is inactivated at a higher 
temperature compared to the corresponding wild-type, unmutated, or 
unmodified reverse transcriptase. Preferably, the inactivation temperature for 
the thermostable reverse transcriptases of the invention is from about 2°C to 
about 50°C (e.g., about 2°C, about 4°C, about 5°C, about 8°C, about 10°C, 
about 12°C, about 14°C, about 16°C, about 18°C, about 20°C, about 24°C, 
about 26°C, about 28°C, about 30°C, about 33°C, about 35°C, about 38°C, 
about 40°C, about 42°C, about 44°C, about 46°C, about 48°C, or about 50°C) 
higher than the inactivation temperature for the corresponding wild-type, 
unmutated, or unmodified reverse transcriptase. More preferably, the 
inactivation temperature for the reverse transcriptases of the invention is from 
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about 5°C to about 50°C, from about 5°C to about 40°C, from about 5°C to 
about 30°C, or from about 5°C to about 25°C greater than the inactivation 
temperature for the corresponding wild-type, unmutated or unmodified reverse 
transcriptase, when compared under the same conditions. 
[0113] The difference in inactivation temperature for the reverse transcriptase 

of the invention compared to its corresponding wild-type, unmutated or 
unmodified reverse transcriptase can be determined by treating samples of 
such reverse transcriptases at different temperatures for a defined time period 
and then measuring residual reverse transcriptase activity, if any, after the 
samples have been heat treated. Determination of the difference or delta in the 
inactivation temperature between the test reverse transcriptase compared to the 
wild-type, unmutated or unmodified control is determined by comparing the 
difference in temperature at which each reverse transcriptase is inactivated 
(i.e., no residual reverse transcriptase activity is measurable in the particular 
assay used). As will be recognized, any number of reverse transcriptase 
assays may be used to determine the different or delta of inactivation 
temperatures for any reverse transcriptases tested. 
[0114] In another aspect, thermostability of a reverse transcriptase of the 

invention may be determined by measuring the half-life of the reverse 
transcriptase activity of a reverse transcriptase of interest. Such half-life may 
be compared to a control or wild-type reverse transcriptase to determine the 
difference (or delta) in half-life. Half-lifes of the reverse transcriptases of the 
invention are preferably determined at elevated temperatures (e.g., greater than 
37°C) and preferably at temperatures ranging from 40°C to 80°C, more 
preferably at temperatures ranging from 45°C to 75°C, 50°C to 70°C, 50°C to 
65°C, and 50°C to 60°C. Preferred half-lifes of the reverse transcriptases of 
the invention may range from 4 minutes to 10 hours, 4 minutes to 7.5 hours, 4 
minutes to 5 hours, 4 minutes to 2.5 hours, or 4 minutes to 2 hours, depending 
upon the temperature used. For example, the reverse transcriptase activity of 
the reverse transcriptases of the invention may have a half-life of at least 4 
minutes, at least 5 minutes, at least 6 minutes, at least 7 minutes, at least 8 
minutes, at least 9 minutes, at least 10 minutes, at least 11 minutes, at least 12 
minutes, at least 13 minutes, at least 14 minutes, at least 15 minutes, at least 
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20 minute, at least 25-minutes, at least 30 minutes, at least 40 minutes, at least 
50 minutes, at least 60 minutes, at least 70 minutes, at least 80 minutes, at 
least 90 minutes, at least 100 minutes, at least 115 minutes, at least 125 
minutes, at least 150 minutes, at least 175 minutes, at least 200 minutes, at 
least 225 minutes, at least 250 minutes, at least 275 minutes, at least 300 
minutes, at least 400 minutes, at least 500 minutes at temperatures of 48°C, 
50°C, 52°C, 52.5°C, 55°C, 57°C, 60°C, 62°C, 65°C, 68°C, and/or 70°C. 

[0115] Terminal extension activity. As used herein, terminal extension 

activity refers to the ability of a reverse transcriptase (RT) to add additional 
bases on to the 3' end of a newly synthesized cDNA strand beyond the 5' end 
of the DNA or mRNA template. Terminal extension activity may add bases 
specifically (witfr a nucleotide bias) of randomly. 

[01 16] Terminal extension activity is also known as terminal deoxynucleotidyl 

transferase (TdT) activity. A reverse transcriptase having reduced TdT 
activity is defined as any reverse transcriptase having lower TdT specific 
activity than the TdT specific activity of the corresponding wild-type, 
unmutated, or unmodified enzyme, for example, less than about 90% of the 
TdT specific activity of the corresponding wild-type, unmutated, or 
unmodified enzyme, less than about 85% of the TdT specific activity of the 
corresponding wild-type, unmutated, or unmodified enzyme, less than about 
80% of the TdT specific activity of the corresponding wild-type, unmutated, 
or unmodified enzyme, less than about 75% of the TdT specific activity of the 
corresponding wild-type, unmutated, or unmodified enzyme, less than about 
50% of the TdT specific activity of the corresponding wild-type, unmutated, 
or unmodified enzyme, less than about 25% of the TdT specific activity of the 
corresponding wild-type, unmutated, or unmodified enzyme, less than about 
15% of the TdT specific activity of the corresponding wild-type, unmutated, 
or unmodified enzyme, less than 10% of the TdT specific activity of the 
corresponding wild-type, unmutated, or unmodified enzyme, less than about 
5% of the TdT specific activity of the corresponding wild-type, unmutated, or 
unmodified enzyme, or less than about 1% of the TdT specific activity of the 
corresponding wild-type, unmutated, or unmodified enzyme. A reverse 
transcriptase of the invention having substantially reduced TdT activity refers 
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to a reverse transcriptase having a TdT specific activity level of 30% or less 
than the TdT specific activity of the corresponding wild-type or TdT + reverse 
transcriptase. Eliminated TdT activity is defined as a level of activity that is 
undetectable by the assay methods set out herein in Example 3. 
[0117] As noted below in Example 3, reverse transcriptases are known in the 

art which extend nucleic acid molecules 2-3 nucleotides past the end of 
templates (e.g., RNA or DNA templates). Further, in any one reaction mixture 
in which reverse transcription occurs, mixtures of molecules may be present 
which contain different numbers of nucleotides that extend beyond the end of 
the template. TdT activity may be determined herein in reference to the 
number or percentage of molecules which contain one or more nucleotides 
which extend beyond the end of the template. For example, if a wild-type 
reverse transcriptase adds 1 or more nucleotides past the end of a template to 
90% of the molecules generated during reverse transcription and a modified 
reverse transcriptase adds 1 or more nucleotides past the end of a template to 
45% of the molecules under the same or similar conditions, then the modified 
reverse transcriptase would be said to exhibit a 50% decrease in TdT activity 
as compared to the wild-type enzyme. Further, an F309N, T306K, H204R 
mutant of M-MLV Superscript™ II has been generated which exhibits about 
0% of the TdT activity exhibited by Superscript™ II when DNA is used as a 
template and about 10-20% of the TdT activity exhibited by SUPERSCRIPT™ II 
when RNA is used as a template. 

[0118] Fidelity. Fidelity refers to the accuracy of polymerization, or the 

ability of the reverse transcriptase to discriminate correct from incorrect 
substrates, (e.g., nucleotides) when synthesizing nucleic acid molecules which 
are complementary to a template. The higher the fidelity of a reverse 
transcriptase, the less the reverse transcriptase misincorporates nucleotides in 
the growing strand during nucleic acid synthesis; that is, an increase or 
enhancement in fidelity results in a more faithful reverse transcriptase having 
decreased error rate or decreased misincorporation rate. 

[0119] A reverse transcriptase having increased/enhanced/higher fidelity is 

defined as a polymerase having any increase in fidelity, preferably about 1.2 to 
about 10,000 fold, about 1.5 to about 10,000 fold, about 2 to about 5,000 fold, 
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or about 2 to about 2000 fold (preferably greater than about 5 fold, more 
preferably greater than about 10 fold, still more preferably greater than about 
50 fold, still more preferably greater than about 100 fold, still more preferably 
greater than about 500 fold and most preferably greater than about 100 fold) 
reduction in the number of misincorporated nucleotides during synthesis of 
any given nucleic acid molecule of a given length compared to the control 
reverse transcriptase. Preferably, the mutant or modified reverse transcriptase 
of the invention is compared to the corresponding unmodified or wild-type 
reverse transcriptase to determine the relative enhancement or increase in 
fidelity. For example, a mutated reverse transcriptase may misincorporate one 
nucleotide in the synthesis of a nucleic acid molecule segment of 1000 bases 
compared to -an unmutated reverse transcriptase misincorporating 10 
nucleotides in the same size segment. Such a mutant reverse transcriptase 
would be said to have an increase of fidelity of 10 fold. 
[0120] Fidelity can also be measured by the decrease in the incidence of frame 

shifting, as described below in Example 5. A reverse transcriptase having 
increased fidelity may be defined as a polymerase or reverse transcriptase 
having any increase in fidelity with respect to frame shifting, as compared to a 
control reverse transcriptase (e.g., a corresponding wild-type and/or a 
corresponding un-mutated or un-modified reverse transcriptase), for example, 
a reverse transcriptase having greater than about 1.2 fold increased fidelity 
with respect to frame shifting, having greater than about 1.5 fold increased 
fidelity with respect to frame shifting, having greater than about 5 fold 
increased fidelity with respect to frame shifting, having greater than about 10 
fold increased fidelity with respect to frame shifting, having greater than about 
20 fold increased fidelity with respect to frame shifting, having greater than 
about 30 fold increased fidelity with respect to frame shifting, or having 
greater than about 40 fold increased fidelity with respect to frame shifting. 
[0121] A reverse transcriptase having increased/enhanced/higher fidelity, with 

respect to frame shifting, can also be defined as a reverse transcriptase or 
polymerase having any increase in fidelity, such as from about 1.5 to about 
10,000 fold, from about 2 to about 5,000 fold, from about 2 to about 2000 
fold, from about 1.5 to about 40 fold, from about 5 to about 40 fold, from 
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about 10 to about-40 fold, from about 20 to about 40 fold, from about 30 to 
about 40 fold, from about 5 to about 30 fold, from about 10 to about 30 fold, 
from about 15 to about 30 fold, from about 20 to about 30 fold, from about 5 
to about 20 fold, from about 10 to about 20 fold, from about 15 to about 20 
fold, from about 10 to about 100 fold, from about 15 to about 100 fold, from 
about 20 to about 100 fold, from about 30 to about 100 fold, or from about 50 
to about 100 fold increased fidelity with respect to frame shifting. 

A reverse transcriptase having reduced misincorporation is defined 
herein as either a mutated or modified reverse transcriptase that has about or 
less than 90%, has about or less than 85%, has about or less than 75%, has 
about or less than 70%, has about or less than 60%, or preferably has about or 
less than 50%, preferably has about or less than 25%, more preferably has 
about or less than 10%, and most preferably has about or less than 1% of 
relative misincorporation compared to the corresponding wild-type, 
unmutated, or unmodified enzyme. 

The fidelity or misincorporation rate of a reverse transcriptase can be 
determined by sequencing or by other methods known in the art (Eckert & 
Kunkel, 1990, Nucl Acids Res. 75:3739-3744). In one example, the sequence 
of a DNA molecule synthesized by the unmutated and mutated reverse 
transcriptases can be compared to the expected (known) sequence. In this 
way, the number of errors (misincorporation or frame shifts) can be 
determined for each enzyme and compared. In another example, the 
unmutated and mutated reverse transcriptases may be used to sequence a DNA 
molecule having a known sequence. The number of sequencing errors 
(misincorporation or frame shifts) can be compared to determine the fidelity or 
misincorporation rate of the enzymes. Other means of determining the fidelity 
or misincorporation rate include a forward complementation assay using an 
RNA template as described below and previously in Boyer J.C. et al Methods 
Enzymol. 275:523 (1996), and are set out in the examples. Other methods of 
determining the fidelity or misincorporation rate will be recognized by one of 
skill in the art. 

Strand jumping. Strand jumping, as used herein, refers to a type of 
random mutation caused by an reverse transcriptase "skipping" more than one 
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(e.g. f two, five,- ten, fifty, -one-hundred, etc.) nucleotides on the mRNA 
template, resulting in a deletion of the corresponding nucleotides in the 
resulting cDNA. Sequencing the synthesized nucleic acid molecule and 
comparing to the expected sequence may allow determination of the level or 
amount of strand jumping for the reverse transcriptases of the invention. This 
level or amount may then be compared to the level or amount of strand 
jumping caused by the corresponding wild type and/or un-modified or un- 
mutated reverse transcriptase. 

[0125] Hand domain. The hand domain, as used herein, refers to those amino 

acids which are in the area or areas that control the template, primer, or 
nucleotide interaction of the reverse transcriptase. This domain is further 
characterized by a group of three regions of secondary structure in a reverse 
transcriptase enzyme, the thumb, fingers and palm regions. The thumb region 
is defined as residing between amino acids 240-315 of HIV reverse 
transcriptase, or between amino acids 280-355 of M-MLV reverse 
transcriptase. The fingers region is defined as residing between amino acids 
1-85 and 120-154 of HIV reverse transcriptase, or between 1-124 and 161-193 
of M-MLV reverse transcriptase. The palm region is defined as residing 
between amino acids 86-199 and 155-239 of HIV reverse transcriptase, or 
between amino acids 126-160 and 193-279 of M-MLV reverse transcriptase. 
These areas are generally defined, and the amino acids defining the N-termini 
and C-termini are approximate. Corresponding regions may also be defined 
for other reverse transcriptases. Preferred reverse transcriptases of the 
invention have one or more modifications or mutations within the hand 
domain. More particularly, reverse transcriptases of the invention comprise 
one or more mutations or modifications within one or more regions, including 
the thumb, finger, and palm regions. 

[0126] Full length. As used herein, full length when used to describe a 

product molecule, e.g., a cDNA molecule, indicates that the product molecule 
is the same length or substantially the same length as the template molecule, 
e.g., an mRNA molecule, from which it is produced by the activity of 
polypeptides of the invention. A cDNA molecule may be substantially the 
same length as the template from which it is copied when it is about 90%, 
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91%, 92%, 93.%, 94%, 95%, 96%, 97%, 98%, 99% or greater of the length of 
the portion of the template located 3' to the 3 1 most nucleotide of the primer 
used to reverse transcribe the template. Thus, if a primer anneals in the center 
of a template, a full length product would be one that contains a cDNA copy 
of half of the template. Template molecules may be from about 100 bases to 
about 50 kb in length, from about 200 bases to about 50 kb in length, from 
about 300 bases to about 50 kb in length, from about 400 bases to about 50 kb 
in length, from about 500 bases to about 50 kb in length, from about 600 bases 
to about 50 kb in length, from about 700 bases to about 50 kb in length, from 
about 800 bases to about 50 kb in length, from about 900 bases to about 50 kb 
in length, and from about 1 kb to about 50 kb in length. In some 
embodiments, template molecules may be from about 500 bases to about 10 
kb in length, from about 600 bases to about 10 kb in length, from about 700 
bases to about 10 kb in length, from about 800 bases to about 10 kb in length, 
from about 900 bases to about 10 kb in length, from about 1000 bases to about 
10 kb in length, from about 1 100 bases to about 10 kb in length, and/or from 
about 1200 bases to about 10 kb in length. In some embodiments, template 
molecules may be from about 250 bases to about 5 kb in length, from about 
300 bases to about 5 kb in length, and from about 350 bases to about 5 kb in 
length, from about 400 bases to about 5 kb in length, from about 450 bases to 
about 5 kb in length, from about 500 bases to about 5 kb in length, from about 
550 bases to about 5 kb in length, from about 600 bases to about 5 kb in 
length, from about 650 bases to about 5 kb in length, from about 700 bases to 
about 5 kb in length, from about 750 bases to about 5 kb in length, from about 
800 bases to about 5 kb in length, and from about 850 bases to about 5 kb in 
length. 

In some embodiments, the ability of a reverse transcriptase to 
synthesize a full length product may be determined using a defined template 
and primer, for example, a polyadenylated template corresponding to the 
chloramphenicol acetyl transferase gene and an oligo(dT) primer, under 
defined reaction conditions, e.g., pH, salt concentration, divalent metal 
concentration, template concentration, temperature, etc. In some 
embodiments, a template molecule is greater than about 500 base pairs in 
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length, and the amount of full length product synthesized may determined by 
separating full length product from truncated product, for example, by gel 
electrophoresis, and quantifying the full length product, for example, by 
incorporating a radiolabel in to the product and using a scintillation counter. 
[0128] About. The term "about" as used herein, means the recited number 

plus or minus 10%. Thus, "about 100" includes 90-1 10. 

Overview 

[0129] In general, the invention provides, in part, compositions for use in 

reverse transcription of a nucleic acid molecule comprising a reverse 
transcriptase with one or more (e.g., one, two, three, four, five, ten, twelve, 
fifteen, twenty, thirty, etc.) mutations or modification which render the reverse 
transcriptase more thermostable. The invention also provides compositions 
for use in reverse transcription of a nucleic acid molecule, the compositions 
comprising a reverse transcriptase with one or more mutations or modification 
which render the reverse transcriptase more efficient, that is having higher 
fidelity, and/or has lower TdT activity than a corresponding un-mutated or un- 
modified reverse transcriptase. The invention further provides compositions 
comprising a reverse transcriptase with one or more mutations or modification 
which render the reverse transcriptase more thermostable and/or more efficient 
than a corresponding un-mutated or un-modified reverse transcriptase. 
[0130] The enzymes in these compositions are preferably present in working 

concentrations and are also preferably reduced, substantially reduced, or 
eliminated in RNase H activity. Alternatively, reverse transcriptases used in 
the compositions of the invention may have RNase H activity. Preferred 
reverse transcriptases include retroviral reverse transcriptases such as M-MLV 
reverse transcriptase, HIV reverse transcriptase, RSV reverse transcriptase, 
AMV reverse transcriptase, RAV reverse transcriptase, and MAV reverse 
transcriptase or other ASLV reverse transcriptases or their corresponding 
RNase H- derivatives. Additional reverse transcriptases which may be used to 
prepare compositions of the invention include bacterial reverse transcriptases 
(e.g., Escherichia coli reverse transcriptase) (see, e.g., Mao et al, Biochem. 
Biophys. Res. Commun. 227:489-93 (1996)) and reverse transcriptases of 



-56- 

Saccharomyces cerevisiae (e.g., reverse transcriptases of the Tyl or Ty3 
retrotransposons) (see, e.g., Cristofari et aL, Jour. Biol. Chem. 
274:36643-36648 (1999); Mules et aL, Jour. Virol. 72:6490-6503 (1998)). 

In accordance with the invention, any number of mutations can be 
made to the reverse transcriptases and, in a preferred aspect, multiple 
mutations can be made to result in an increased thermostability and/or to 
confer other desired properties on reverse transcriptases of the invention. 
Such mutations include point mutations, frame shift mutations, deletions and 
insertions, with one or more (e.g., one, two, three, four, five, ten, twelve, 
fifteen, twenty, thirty, etc.) point mutations preferred. Mutations may be 
introduced into reverse transcriptases of the present invention using any 
methodology kno\Vn to those of skill in the art. Mutations may be introduced 
randomly by, for example, conducting a PCR reaction in the presence of 
manganese as a divalent metal ion cofactor. Alternatively, oligonucleotide 
directed mutagenesis may be used to create the mutant polymerases which 
allows for all possible classes of base pair changes at any determined site 
along the encoding DNA molecule. In general, this technique involves 
annealing an oligonucleotide complementary (except for one or more 
mismatches) to a single stranded nucleotide sequence coding for the reverse 
transcriptase of interest. The mismatched oligonucleotide is then extended by 
DNA polymerase, generating a double-stranded DNA molecule which 
contains the desired change in sequence in one strand. The changes in 
sequence can, for example, result in the deletion, substitution, or insertion of 
an amino acid. The double-stranded polynucleotide can then be inserted into 
an appropriate expression vector, and a mutant or modified polypeptide can 
thus be produced. The above-described oligonucleotide directed mutagenesis 
can, for example, be carried out via PCR. 

The invention is also directed to methods for reverse transcription of 
one or more (e.g., one, two, three, four, five, ten, twelve, fifteen, twenty, etc.) 
nucleic acid molecules comprising mixing one or more (e.g., one, two, three, 
four, five, ten, twelve, fifteen, twenty, etc.) nucleic acid templates, which are 
preferably RNA or messenger RNA (mRNA) and more preferably a 
population of mRNA molecules, with one or more reverse transcriptase of the 
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present indention and. incubating the mixture under conditions sufficient to 
make a nucleic acid molecule or molecules complementary to all or a portion 
of the one or more (eg,, one, two, three, four, five, ten, twelve, fifteen, twenty, 
thirty, etc.) templates. To make the nucleic acid molecule or molecules 
complementary to the one or more templates, a primer (e.g., an oligo(dT) 
primer) and one or more nucleotides are preferably used for nucleic acid 
synthesis in the 5* to 3' direction. Nucleic acid molecules suitable for reverse 
transcription according to this aspect of the invention include any nucleic acid 
molecule, particularly those derived from a prokaryotic or eukaryotic cell. 
Such cells may include normal cells, diseased cells, transformed cells, 
established cells, progenitor cells, precursor cells, fetal cells, embryonic cells, 
bacterial, cells, yeast cells, animal cells (including human cells), avian cells, 
plant cells and the like, or tissue isolated from a plant or an animal (e.g., 
human, cow, pig, mouse, sheep, horse, monkey, canine, feline, rat, rabbit, bird, 
fish, insect, etc.). Nucleic acid molecules suitable for reverse transcription 
may also be isolatejd and/or obtained from viruses and/or virally infected cells. 
[0133] The invention further provides methods for amplifying or sequencing a 

nucleic acid molecule comprising contacting the nucleic acid molecule with a 
reverse transcriptase of the present invention. Preferred such methods 
comprise one or more polymerase chain reactions (PCRs). 

Sources of Reverse Transcriptases 

[0134] Enzymes for use in compositions, methods and kits of the invention 

include any enzyme having reverse transcriptase activity. Such enzymes 
include, but are not limited to, retroviral reverse transcriptase, retrotransposon 
reverse transcriptase, hepatitis B reverse transcriptase, cauliflower mosaic 
virus reverse transcriptase, bacterial reverse transcriptase, Tth DNA 
polymerase, Tag DNA polymerase (Saiki, R.K., et al., Science 239:487-491 
(1988^; U.S. Patent Nos. 4,889,818 and 4,965,188), Tne DNA polymerase 
(PCT Publication No. WO 96/10640), Tma DNA polymerase (U.S. Patent No. 
5,374,553) and mutants, fragments, variants or derivatives thereof {see, e.g., 
commonly owned U.S. Patent Nos. 5,948,614 and 6,015,668, which are 
incorporated by reference herein in their entireties). Preferably, reverse 
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transcriptases for use in the invention include retroviral reverse transcriptases 
such as M-MLV reverse transcriptase, AMV reverse transcriptase, RSV 
reverse transcriptase, RAV reverse transcriptase, MAV reverse transcriptase, 
and generally ASLV reverse transcriptases. As will be understood by one of 
ordinary skill in the art, modified reverse transcriptases may be obtained by 
recombinant or genetic engineering techniques that are routine and well- 
known in the art. Mutant reverse transcriptases can, for example, be obtained 
by mutating the gene or genes encoding the reverse transcriptase of interest by 
site-directed or random mutagenesis. Such mutations may include point 
mutations, deletion mutations and insertional mutations. For example, one or 
more point mutations (e.g. , substitution of one or more amino acids with one 
or more different amino acids) may be used to construct mutant reverse 
transcriptases of the invention. 
[0135] The invention further includes reverse transcriptases which are 70%, 

75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical at the amino 
acid level to a wild-type reverse transcriptase (e.g., M-MLV reverse 
transcriptase, AMV reverse transcriptase, RSV reverse transcriptase, HIV 
reverse transcriptase, etc.) and exhibit increased thermostability and/or other 
desired properties of the invention. Also included within the invention are 
reverse transcriptases which are 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 
98%, or 99% identical at the amino acid level to a reverse transcriptase 
comprising the amino acid sequence set out below in Table 3 (SEQ ID NO:2) 
and exhibit increased thermostability and/or more efficient (e.g., having higher 
fidelity and/or having lower TdT activity). The invention also includes 
nucleic acid molecules which encode the above described reverse 
transcriptases. 

[0136] The invention also includes fragments of reverse transcriptases which 

comprise at least 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, or 700 
amino acid residues and retain one or more activities associated with reverse 
transcriptases. Such fragments may be obtained by deletion mutation, by 
recombinant techniques that are routine and well-known in the art, or by 
enzymatic digestion of the reverse transcriptase(s) of interest using any of a 
number of well-known proteolytic enzymes. The invention further includes 
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nucleic acid molecules which encode the above described mutant reverse 
transcriptases and reverse transcriptase fragments. 
[0137] Reverse transcriptase fragments of the invention also comprise amino 

acids 1-10, 11-20, 21-30, 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 91-100, 
101-110, 111-120, 121-130, 131-140, 141-150, 151-160, 161-170, 171-180, 
181-190, 191-200, 201-210, 211-220, 221-230, 231-240, 241-250, 251-260, 
261-270, 271-280, 281-290, 291-300, 301-310, 311-320, 321-330, 331-340, 
341-350, 351-360, 361-370, 371-380, 381-390, 391-400, 401-410, 411-420, 
421-430, 431-440, 441-450, 451-460, 461-470, 471-480, 481-490, 491-500, 
501-510, 511-520, 521-530, 531-540, and/or 541-550 and/or amino acids 1- 
355, 1-498, 1-500, and/or 1-550 of M-MLV reverse transcriptase (and more 
preferably the sequence shown in Table 3, which may further contain one or 
more of the modifications or mutations discussed herein), as well as 
corresponding fragments of other reverse transcriptases. Reverse transcriptase 
fragments of the invention further comprise polypeptides which are 70%, 
75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to one or more 
of the fragments set out above. The invention also concerns various 
combinations of any number of these fragments. 
[0138] By a protein or protein fragment having an amino acid sequence at 

least, for example, 70% "identical" to a reference amino acid sequence it is 
intended that the amino acid sequence of the protein is identical to the 
reference sequence except that the protein sequence may include up to 30 
amino acid alterations per each 100 amino acids of the amino acid sequence of 
the reference protein. In other words, to obtain a protein having an amino acid 
sequence at least 70% identical to a reference amino acid sequence, up to 30% 
of the amino acid residues in the reference sequence may be deleted or 
substituted with another amino acid, or a number of amino acids up to 30% of 
the total amino acid residues in the reference sequence may be inserted into 
the reference sequence. These alterations of the reference sequence may occur 
at the amino (N-) and/or carboxy (C-) terminal positions of the reference 
amino acid sequence and/or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence 
and/or in one or more contiguous groups within the reference sequence. As a 
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practical matter, whether a given amino acid sequence is, for example, at least 
70% identical to the amino acid sequence of a reference protein can be 
determined conventionally using known computer programs such as those 
described above for nucleic acid sequence identity determinations, or using the 
CLUSTAL W program (Thompson, J.D., et al. t Nucleic Acids Res. 22:4673- 
4680(1994)). 

[0139] Sequence identity may be determined by comparing a reference 

sequence or a subsequence of the reference sequence to a test sequence. The 
reference sequence and the test sequence are optimally aligned over an 
arbitrary number of residues termed a comparison window. In order to obtain 
optimal alignment, additions or deletions, such as gaps, may be introduced 
into the test: sequence. " The percent sequence identity is determined by 
determining the number of positions at which the same residue is present in 
both sequences and dividing the number of matching positions by the total 
length of the sequences in the comparison window and multiplying by 100 to 
give the percentage. In addition to the number of matching positions, the 
number and size of gaps is also considered in calculating the percentage 
sequence identity. 

[0140] Sequence identity is typically determined using computer programs. A 

representative program is the BLAST (Basic Local Alignment Search Tool) 
program publicly accessible at the National Center for Biotechnology 
Information (NCBI, http://www.ncbi.nlm.nih.gov/). This program compares 
segments in a test sequence to sequences in a database to determine the 
statistical significance of the matches, then identifies and reports only those 
matches that that are more significant than a threshold level. A suitable 
version of the BLAST program is one that allows gaps, for example, version 
2.X (Altschul, et al. 9 Nucleic Acids Res. 25(17):3389-402, 1997). Standard 
BLAST programs for searching nucleotide sequences (blastn) or protein 
(blastp) may be used. Translated query searches in which the query sequence 
is translated, i.e., from nucleotide sequence to protein (blastx) or from protein 
to nucleic acid sequence (tbblastn) may also be used as well as queries in 
which a nucleotide query sequence is translated into protein sequences in all 6 
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reading frames and then compared to an NCBI nucleotide database which has 
been translated in all six reading frames (tbblastx). 

[0141] Additional suitable programs for identifying proteins with sequence 

identity to the proteins of the invention include, but are not limited to, PHI- 
BLAST (Pattern Hit Initiated BLAST, Zhang, et aL, Nucleic Acids Res. 
26(17):3986-90, 1998) and PSI-BLAST (Position-Specific Iterated BLAST, 
Altschul, etal, Nucleic Acids Res. 25(17):3389-402, 1997). 

[0142] Programs may be used with default searching parameters. 

Alternatively, one or more search parameter may be adjusted. Selecting 
suitable search parameter values is within the abilities of one of ordinary skill 
in the art. 

[0143] Preferred enzymes for use in the invention include those that are 

reduced, substantially reduced, or lacking in RNase H activity. Such enzymes 
that are reduced or substantially reduced in RNase H activity may be obtained 
by mutating, for example, the RNase H domain within the reverse 
transcriptase of interest, for example, by introducing one or more {e.g. , one, 
two, three, four, five, ten, twelve, fifteen, twenty, thirty, etc.) point mutations, 
one or more {e.g., one, two, three, four, five, ten, twelve, fifteen, twenty, 
thirty, etc.) deletion mutations, and/or one or more {e.g., one, two, three, four, 
five, ten, twelve, fifteen, twenty, thirty, etc.) insertion mutations as described 
above. In some embodiments, the reverse transcriptase of the invention does 
not contain a modification or mutation in the RNase H domain and preferably 
does not contain a modification which reduces RNase H activity. In one 
aspect, the reverse transcriptase of the invention has 90%, 95%, or 100% of 
the RNase H activity compared to the corresponding wildtype reverse 
transcriptase. 

[0144] By an enzyme "substantially reduced in RNase H activity" is meant 

that the enzyme has less than about 30%, less than about 25%, less than about 
20%, more preferably less than about 15%, less than about 10%, less than 
about 7.5%, or less than about 5%, and most preferably less than about 5% or 
less than about 2%, of the RNase H activity of the corresponding wild-type or 
RNase H + enzyme, such as wild-type Moloney Murine Leukemia Virus (M- 
MLV), Avian Myeloblastosis Virus (AMV) or Rous Sarcoma Virus (RSV) 
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re verse -transcriptases. A reduction in RNase H activity means any reduction 
in the activity compared, for example, to the corresponding wild type or un- 
mutatated or un-modified reverse transcriptase. 

[01451 Reverse transcriptases having reduced, substantially reduced, 

undetectable or lacking RNase H activity have been previously described (see 
U.S. Patent 5,668,005, U.S. Patent 6,063,608, and PCT Publication No. WO 
98/47912). The RNase H activity of any enzyme may be determined by a 
variety of assays, such as those described, for example, in U.S. Patent No. 
5,244,797, in Kotewicz, M.L., et al. 9 Nucl Acids Res. 16:265 (1988), in 
Gerard, G.F., et al, FOCUS 14(5):9\ (1992), in PCT publication number WO 
98/47912, and in U.S. Patent No. 5,668,005, the disclosures of all of which are 
fully incorporated herein by reference. Reverse transcriptases having no 
detectable RNase H activity or lacking RNase H activity by one or more of the 
described assays are particularly preferred. 

[0146] Particularly preferred enzymes for use in the invention include, but are 

not limited to, M-MLV RNase H- reverse transcriptase, RSV RNase H- 
reverse transcriptase, AMV RNase H- reverse transcriptase, RAV RNase El- 
reverse transcriptase, MAV RNase H- reverse transcriptase and HIV RNase 
H- reverse transcriptase. It will be understood by one of ordinary skill, 
however, that any enzyme capable of producing a DNA molecule from a 
ribonucleic acid molecule (i.e., an enzyme having reverse transcriptase 
activity) that is reduced, substantially reduced, or lacking in RNase H activity 
may be equivalently used in the compositions, methods and kits of the 
invention. 

[0147] Enzymes for use in the invention also include those in which terminal 

deoxynucleotidyl transferase (TdT) activity has been reduced, substantially 
reduced, or eliminated. Such enzymes that are reduced or substantially 
reduced in terminal deoxynucleotidyl transferase activity, or in which TdT 
activity has been eliminated, may be obtained by mutating, for example, 
amino acid residues within the reverse transcriptase of interest which are in 
close proximity or in contact with the template-primer, for example, by 
introducing one or more (e.g., one, two, three, four, five, ten, twelve, fifteen, 
twenty, thirty, etc.) point mutations, one or more deletion mutations, and/or 
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one or. more insertion mutations. Reverse transcriptases which exhibit 
decreased TdT activity are described in U.S. Appl. No. 09/808,124, filed 
March 15, 2001 (the entire disclosure of which is incorporated herein by 
reference), and include reverse transcriptases with one or more alterations at 
amino acid positions equivalent or corresponding to Y64, M289, F309, T197 
and/or Y133 of M-MLV reverse transcriptase. 

[0148] In one aspect, amino acid substitutions are made at one or more of the 

above identified positions (i.e., amino acid positions equivalent or 
corresponding to Y64, M289, F309, T197 or Y133 of M-MLV reverse 
transcriptase). Thus, the amino acids at these positions may be substituted 
with any other amino acid including Ala, Arg, Asn, Asp, Cys, Gin, Glu, Gly, 
His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, and Val. Specific 
example of reverse transcriptases which exhibit reduced, substantially 
reduced, or eliminated TdT activity include M-MLV reverse transcriptases 
(e.g., Superscript™ II) in which (1) the phenylalanine residue at position 
309 has been replaced with asparagine, (2) the threonine residue at position 
197 has been replaced with either alanine or glutamic acid, and/or (3) the 
tyrosine residue at position 133 has been replaced with alanine. 

[0149] Enzymes for use in the invention also include those that exhibit 

increased fidelity. Reverse transcriptases which exhibit increased fidelity are 
described in U.S. Appl. No. 60/189,454, filed March 15, 2000, and U.S. Appl. 
No. 09/808,124, filed March 15, 2001 (the entire disclosures of each of which 
are incorporated herein by reference), and include reverse transcriptases with 
one or more alterations at positions equivalent or corresponding to those set 
out below in Table 2. 



Table 2 


RT 


Amino Acid 


M-MLV 


Y64 (e.g., Y64W, Y64R), R116 (e.g., R116M), K152 (e.g., K152R), 
Q190 (e.g., Q190F), T197 (e.g., T197A, T197E), V223 (e.g., V223H, 
V223I, V223F), D124, H126, Y133 (e.g., Y133A, Y133H), F309 (e.g., 
F309N, F309R) 


AMV 


W25, R76, Kl 10, Q149, T156, M182 


RSV 


W25, R76, Kl 10, Q149, T156, M182 


HIV 


W24, R78, G112, Q151, A158, M184 
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[0150] In soma embodiments of the invention, amino acid substitutions are 

made at one or more of the above identified positions. Thus, the amino acids 
at these positions may be substituted with any other amino acid including Ala, 
Arg, Asn, Arg, Asp, Cys, Gin, Glu, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, 
Thr, Trp, Tyr, and Val. Specific example of reverse transcriptases which 
exhibit increased fidelity include M-MLV reverse transcriptase in which (1) 
the valine residue at position 223 has been replaced with histidine, 
phenylalanine or isoleucine, (2) the arginine residue at position 116 has been 
replaced with methionine, (3) the lysine residue at position 152 has been 
replaced with arginine, (4) the glutamic acid residue at position 190 has been 
replaced with phenylalanine, (5) the threonine residue at position 197 has been 
replaced with alanine- or glutamic acid, (6) the phenylalanine residue at 
position 309 has been replaced with asparagine or arginine, (7) the tyrosine 
residue at position 133 has been replaced with histidine or alanine, and/or (8) 
the tyrosine residue at position 64 has been replaced with tryptophan or 
arginine. 

[0151] Thus, in specific embodiments, the invention includes reverse 

transcriptases which exhibit increased thermostability and, optionally, also 
exhibit one or more of the following characteristics: (1) reduced or 
substantially reduced RNase H activity, (2) reduced or substantially reduced 
TdT activity, and/or (3) increased fidelity. 

[0152] The invention also relates to reverse transcriptase mutants, where the 

mutations or substitutions have been made in a recognized region of the 
reverse transcriptase enzyme. Such regions include, but are not limited to, the 
fingers, palm, thumb, ot-helix H, p-sheet 10, and/or P-sheet 11 regions. Thus, 
the invention includes reverse transcriptases which exhibit increased 
thermostability (as well as other properties), as described elsewhere herein, 
and have one or more (e.g., one, two, three, four, five, ten, fifteen, etc.) 
mutations or modification in the hand domain and, more specifically, in one or 
more regions including the fingers, palm and/or thumb regions. 
[0153] Polypeptides having reverse transcriptase activity for use in the 

invention may be isolated from their natural viral or bacterial sources 
according to standard procedures for isolating and purifying natural proteins 
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that are well-known to one of ordinary skill in the art (see, e.g., Houts, G.E., et 
al % J. Virol. 29:517 (1979); U.S. patent number 5,668,005; and PCT 
publication number WO 98/47912). In addition, polypeptides having reverse 
transcriptase activity may be prepared by recombinant DNA techniques that 
are familiar to one of ordinary skill in the art (see, e.g., Kotewicz, M.L., et al, 
NucL Acids Res. 76:265 (1988); Soltis, D.A., and Skalka, A.M., Proc. Natl 
Acad. Set USA 35:3372-3376 (1988)); U.S. patent no. 5,668,005; and PCT 
publication no. WO 98/47912. 
[0154] In one aspect of the invention, mutant or modified reverse 

transcriptases are made by recombinant techniques. A number of cloned 
reverse transcriptase genes are available or may be obtained using standard 
recombinant techniques (see U.S. Patent No. 5,668,005 and PCT Publication 
No. WO 98/47912). 

[0155] To clone a gene or other nucleic acid molecule encoding a reverse 

transcriptase which will be modified in accordance with the invention, isolated 
DNA which contains the reverse transcriptase gene or open reading frame may 
be used to construct a recombinant DNA library. Any vector, well known in 
the art, can be used to clone the reverse transcriptase of interest. However, the 
vector used must be compatible with the host in which the recombinant vector 
will be transformed. 

[0156] Prokaryotic vectors for constructing the plasmid library include 

plasmids such as those capable of replication in E. coli such as, for example, 
pBR322, ColEl, pSClOl, pUC-vectors (pUC18, pUC19, etc.: In: Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York (1982); and Sambrook et al., In: Molecular Cloning 
A Laboratory Manual (2d ed.) Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York (1989)). Bacillus plasmids include pC194, 
pUBl 10, pE194, pC221, pC217, etc. Such plasmids are disclosed by Glyczan, 
T. In: The Molecular Biology Bacilli, Academic Press, York (1982), 307-329. 
Suitable Streptomyces plasmids include pi J 101 (Kendall et al, J. Bacteriol 
769:4177-4183 (1987)). Pseudomonas plasmids are reviewed by John et al, 
(Rad. Insea Dis. 5:693-704 (1986)), and Igaki, (Jpn. J. Bacteriol 55:729-742 
(1978)). Broad-host range plasmids or cosmids, such as pCP13 (Darzins and 
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Chakrabarty,/. Bacteriol 159:9-1% (1984)) can also be used for the present 
invention. Preferred vectors for cloning the genes and nucleic acid molecules 
of the present invention are prokaryotic vectors. Preferably, pBAD, pCP13 
and pUC vectors are used to clone the genes of the present invention. Other 
suitable vectors are known to those skilled in the art and are commercially 
available, for example, from Invitrogen Corporation, Carlsbad, CA. 

[0157] Suitable host for cloning the reverse transcriptase genes and nucleic 

acid molecules of interest are prokaryotic hosts. One example of a prokaryotic 
host is E. coli. However, the desired reverse transcriptase genes and nucleic 
acid molecules of the present invention may be cloned in other prokaryotic 
hosts including, but not limited to, hosts in the genera Escherichia, Bacillus, 
Streptomyces, Pseudomonas, Salmonella, Serratia, and Proteus. Bacterial 
hosts of particular interest include E. coli DH10B, which may be obtained 
from Invitrogen Corporation (Carlsbad, CA). 

[0158] Eukaryotic hosts for cloning and expression of the reverse transcriptase 

of interest include yeast, fungal, and mammalian cells. Expression of the 
desired reverse transcriptase in such eukaryotic cells may require the use of 
eukaryotic regulatory regions which include eukaryotic promoters. Cloning 
and expressing the reverse transcriptase gene or nucleic acid molecule in 
eukaryotic cells may be accomplished by well known techniques using well 
known eukaryotic vector systems. 

[0159] Once a DNA library has been constructed in a particular vector, an 

appropriate host is transformed by well known techniques. Transformed cells 
are plated at a density to produce approximately 200-300 transformed colonies 
per petri dish. For selection of reverse transcriptase, colonies are then 
screened for the expression of a reverse transcriptase or a thermostable reverse 
transcriptase as described in the Examples below. Briefly, overnight cultures 
of individual transformant colonies are assayed directly for reverse 
transcriptase or thermostable reverse transcriptase activity and/or other 
desirable activities using a labeled deoxynucleotide and analyzed for the 
presence of labeled product. If thermostable reverse transcriptase activity 
and/or other desirable activity is detected, the mutant is sequenced to 
determine which amino acids maintained reverse transcriptase activity. The 
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gene or nucleic acid molecule encoding a reverse transcriptase of the present 
invention can be cloned using techniques known to those in the art. 

Modifications or Mutations of Polymerases 

In accordance with the invention, one or more mutations may be made 
in any reverse transcriptase in order to increase the thermostability of the 
enzyme, or confer other properties described herein upon the enzyme, in 
accordance with the invention. Such mutations include point mutations, frame 
shift mutations, deletions and insertions. Preferably, one or more point 
mutations, resulting in one or more amino acid substitutions, are used to 
produce reverse transcriptases having enhanced or increased thermostability. 
In a preferred aspect of the invention, one or more mutations at positions 
equivalent or corresponding to position H204 (e.g., H204R) and/or T306 (e.g., 
T306K or T306R) of M-MLV reverse transcriptase may be made to produced 
the desired result in other reverse transcriptases of interest. 

In specific embodiments, one or more mutations at positions equivalent 
or corresponding to position L52, Y64, Rl 16, Y133, K152 Q190, T197, H204, 
V223, M289, T306 and/or F309 of M-MLV reverse transcriptase may be 
made to produced a desired result (e.g., increased thermostability, increased 
fidelity, decreased TdT activity, etc.). Thus, in specific embodiments, using 
amino acid positions of M-MLV reverse transcriptase as a frame of reference, 
proteins of the invention include reverse transcriptases (e.g., M-MLV reverse 
transcriptase, AMV reverse transcriptase, HIV reverse transcriptase, RSV 
reverse transcriptase, etc.) having one or more of the following alterations: 
L52P, Y64S, Y64W, Y64R, R116M, Y133A, Y133H, K152R, K152M, 
Q190F, T197R, T197E, T197A, T197K, H204R, V223H, V223F, V223I, 
M289L, T306K, T306R, F309R, and/or F309N, as well as compositions 
containing these proteins, nucleic acid molecules which encode these proteins, 
and host cells which contain these nucleic acid molecules. 

Mutations in reverse transcriptases which alter thermostability 
properties of these proteins may be present in conjunction with alterations 
which either have little or no effect on activities normally associated with 
reverse transcriptases (e.g., RNase H activity, reverse transcriptase or 
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polymerase- activity, terminal deoxynucleotidyl transferase (TdTase) activity, 
etc.) or substantially alter one or more activities normally associated with 
reverse transcriptases. One example of a reverse transcriptase which has such 
a combination of mutations is a M-MLV reverse transcriptase which has the 
following alterations: K152M, V223H. 

[0163] One mutation which has been shown to enhanced the fidelity of 

Superscript™ II (Invitrogen Corporation (Carlsbad, CA) Catalog No. 
18064-022) is V223H (see U.S. Appl. No. 60/189,454, filed March 15, 2000, 
U.S. Appl. No. 09/808,124, filed March 15, 2001, and PCT publication 
number WO 01/68895, the entire disclosures of each of which are 
incorporated herein by reference). However, the V223H alteration decreases 
the thermostability of this enzyme. One mutant was identified, K152M, which 
suppress the destabilizing effect of enzymes having the V223H mutation. 
Thus, the invention includes M-MLV reverse transcriptase which contain 
alterations at positions K152 and V223 and exhibit both increased fidelity and 
increased thermostability. Specific examples of such reverse transcriptases are 
those in which K152 is replaced with methionine and V223 is replaced with 
histidine. Other reverse transcriptases (e.g., AMV reverse transcriptase, HIV 
reverse transcriptase, RSV reverse transcriptase, etc.) with corresponding 
alterations are also included within the scope of the invention. 

[0164] Superscript™ II is an RNase H- reverse transcriptase from M-MLV 

which has the following substitutions: D524G, E562Q, and D583N (see U.S. 
Patent Nos. 5,017,492, 5,244,797, 5,405,776, 5,668,005, and 6,063,608, the 
entire disclosures of which are incorporated herein by reference). The 
invention includes reverse transcriptases that contain alterations, such as those 
reference positions (i.e., 524, 562, and/or 583)or equivalent positions. 

[01651 One or more amino acid substitutions are made at one or more selected 

positions for any reverse transcriptase of interest. Thus, the amino acids at the 
selected positions may be substituted with any other amino acid including Ala, 
Arg, Asn, Asp, Cys, Gin, Glu, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, 
Tip, Tyr, and Val. In some preferred embodiments, the selected amino acid 
will be a non-charged surface residue and will be replaced by a charged 
residue. In some preferred embodiments, the non-charged surface residue may 
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, be replaced- by a positively charged amino acid (e.g., lysine or arginine). In 
one aspect, a charged residue will be replaced with an un-charged residue. In 
one aspect, a non-charged residue will be replaced with a negatively charged 
residue. In another aspect, a negatively charged residue will be replaced with 
a positively charged residue and/or a positively charged residue will be 
replaced with a negatively charged residue. 

[0166] The corresponding positions of M-MLV reverse transcriptase 

identified above may be readily identified for other reverse transcriptases by 
one with skill in the art. Thus, given the defined region and the assays 
described in the present application, one with skill in the art can make one or a 
number of modifications which would result in increased thermostability 
and/or other desired features of any reverse transcriptase of interest. Residues 
to be modified in accordance with the present invention may include those 
listed in Table 1 above. 

[0167] The nucleotide sequences for M-MLV reverse transcriptase (Shinnick 

et al. 9 1981, Nature 2P3:543-548; Georgiadis et al., 1995, Structure 3:879- 
892), AMV reverse transcriptase (Joliot et al., 1993, Virology 7P5:812-819), 
RSV reverse transcriptase (Schwartz et al, 1983, Cell 32:853-859), and HIV 
reverse transcriptase (Wong-Staal et al 9 1985, Nature 373:277-284) are 
known and are incorporated herein by reference in their entirety. 

[0168] Preferably, oligonucleotide directed mutagenesis is used to create the 

mutant reverse transcriptases which allows for all possible classes of base pair 
changes at any determined site along the encoding DNA molecule. 

Enhancing Expression of Reverse Transcriptases 

[0169] To optimize expression of reverse transcriptases of the present 

invention, inducible or constitutive promoters are well known and may be 
used to express high levels of a reverse transcriptase structural gene in a 
recombinant host. Similarly, high copy number vectors, well known in the art, 
may be used to achieve high levels of expression. Vectors having an inducible 
high copy number may also be useful to enhance expression of the reverse 
transcriptases of the invention in a recombinant host. 
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[0170]. To express the desired structural gene in a prokaryotic cell (such as E. 

coll B. subtilis, Pseudomonas, etc.), it is preferable to operably link the 
desired structural gene to a functional prokaryotic promoter. However, the 
natural promoter of the reverse transcriptase gene may function in prokaryotic 
hosts allowing expression of the reverse transcriptase gene. Thus, the natural 
promoter or other promoters may be used to express the reverse transcriptase 
gene. Such other promoters that may be used to enhance expression include 
constitutive or regulatable {i.e., inducible or derepressible) promoters. 
Examples of constitutive promoters include the int promoter of bacteriophage 
X, and the bla promoter of the P-lactamase gene of pBR322. Examples of 
inducible prokaryotic promoters include the major right and left promoters of 
bacteriophage X (P R and P L ), trp, recA, lacZ, lacl, tet, gal trc, ara BAD 
(Guzman, et al, 1995, J. Bacteriol. 177(14):4121-4130) and tac promoters of 
E. coli. The B. subtilis promoters include (X-amylase (Ulmanen et al, J. 
Bacteriol 7(52:176-182 (1985)) and Bacillus bacteriophage promoters 
(Gryczan, T., In: The Molecular Biology Of Bacilli Academic Press, New 
York (1982)). Streptomyces promoters are described by Ward et al, Mol 
Gen. Genet. 205:468478 (1986)). Prokaryotic promoters are also reviewed by 
Glick, J. Ind. Microbiol. 7:277-282 (1987); Cenatiempto, Y., Biochimie 
68:505-516 (1986); and Gottesman, Ann. Rev. Genet. 75:415-442 (1984). 
Expression in a prokaryotic cell also requires the presence of a ribosomal 
binding site upstream of the gene-encoding sequence. Such ribosomal binding 
sites are disclosed, for example, by Gold et al, Ann. Rev. Microbiol. 
55:365404(1981). 

[0171] To enhance the expression of polymerases of the invention in a 

eukaryotic cell, well known eukaryotic promoters and hosts may be used. 
Enhanced expression of the polymerases may be accomplished in a 
prokaryotic host. One example of a prokaryotic host suitable for use with the 
present invention is Escherichia coli. 

Isolation and Purification of Reverse Transcriptases 

[0172] The enzyme(s) of the present invention is preferably produced by 

growth in culture of the recombinant host containing and expressing the 
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desired -reverse transcriptase gene. However, the reverse transcriptase of the 
present invention may be isolated from any strain, organism, or tissue which 
produces the reverse transcriptase of the present invention. Fragments of the 
reverse transcriptase are also included in the present invention. Such 
fragments include proteolytic fragments and fragments having reverse 
transcriptase activity. Such fragments may also be produced by cloning and 
expressing portions of the reverse transcriptase gene of interest, creating frame 
shift mutations and/or by adding one or more stop codons in the gene of 
interest for expression of a truncated protein or polypeptide. Preferably, 
polypeptides of the invention may be purified and/or isolated from a cell or 
organism expressing them, which may be a wild type cell or organism or a 
recombinant cell or organism. In some embodiments, such polypeptides may 
be substantially isolated from the cell or organism in which they are 
expressed. 

[0173] Any nutrient that can be assimilated by a host containing the cloned 

reverse transcriptase gene may be added to the culture medium. Optimal 
culture conditions should be selected case by case according to the strain used 
and the composition of the culture medium. Antibiotics may also be added to 
the growth media to insure maintenance of vector DNA containing the desired 
gene to be expressed. Media formulations have been described in DSM or 
ATCC Catalogs and Sambrook et al. y In: Molecular Cloning, a Laboratory 
Manual (2nd ed.) 5 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY (1989). 

[0174] Recombinant host cells producing the reverse transcriptases of this 

invention can be separated from liquid culture, for example, by centrifugation. 
In general, the collected microbial cells are dispersed in a suitable buffer, and 
then broken open by ultrasonic treatment or by other well known procedures 
to allow extraction of the enzymes by the buffer solution. After removal of 
cell debris by ultracentrifugation or centrifugation, the reverse transcriptases 
can be purified by standard protein purification techniques such as extraction, 
precipitation, chromatography, affinity chromatography, electrophoresis or the 
like. Assays to detect the presence of the reverse transcriptase during 
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purification are well known in the art and can be used during conventional 
biochemical purification methods to determine the presence of these enzymes. 

[0175] In some embodiments, reverse transcriptases of the present invention 

may be modified to contain an affinity tag in order to facilitate the purification 
of the reverse transcriptase. Suitable affinity tags are well known to those 
skilled in the art and include, but are not limited to, repeated sequences of 
amino acids such as six histidines, epitopes such as the hemagglutinin epitope 
and the myc epitope, and other amino acid sequences that permit the 
simplified purification of the reverse transcriptase. 

[01 76] The invention further relates to fusion proteins comprising (1) a 

protein, or fragment thereof, having one or more activity associated with a 
reverse transcriptase and (2) a tag (e.g., an affinity tag). In particular 
embodiments, the invention includes a reverse transcriptase (e.g., a 
thermostable reverse transcriptase) described herein having one or more (e.g., 
one, two, three, four, five, six, seven, eight, etc.) tags. These tags may be 
located, for example, (1) at the N-terminus, (2) at the C-terminus, or (3) at 
both the N-terminus and C-terminus of the protein, or a fragment thereof 
having one or more activities associated with a reverse transcriptase. A tag 
may also be located internally (e.g., between regions of amino acid sequence 
derived from a reverse transcriptase and/or attached to an amino acid side 
chain). For example, Ferguson et al, Protein Sci. 7:1636-1638 (1998), 
describe a siderophore receptor, FhuA, from Escherichia coli into which an 
affinity tag (i.e., a hexahistidine sequence) was inserted. This tag was shown 
to function in purification protocols employing metal chelate affinity 
chromatography. Additional fusion proteins with internal tags are described in 
U.S. Patent No. 6,143,524, the entire disclosure of which is incorporated 
herein by reference. 

[0177] Tags used to prepare compositions of the invention may vary in length 

but will typically be from about 5 to about 500, from about 5 to about 100, 
from about 10 to about 100, from about 15 to about 100, from about 20 to 
about 100, from about 25 to about 100, from about 30 to about 100 from about 
35 to about 100, from about 40 to about 100, from about 45 to about 100, from 
about 50 to about 100, from about 55 to about 100, from about 60 to about 
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100, from about 65 to about 100, from about 70 to about 100, from about 75 to 
about 100, from about 80 to about 100, from about 85 to about 100, from 
about 90 to about 100, from about 95 to about 100, from about 5 to about 80, 
from about 10 to about 80, from about 20 to about 80, from about 30 to about 
80, from about 40 to about 80, from about 50 to about 80, from about 60 to 
about 80, from about 70 to about 80, from about 5 to about 60, from about 10 
to about 60, from about 20 to about 60, from about 30 to about 60, from about 
40 to about 60, from about 50 to about 60, from about 5 to about 40, from 
about 10 to about 40, from about 20 to about 40, from about 30 to about 40, 
from about 5 to about 30, from about 10 to about 30, from about 20 to about 
30, from about 5 to about 25, from about 10 to about 25, or from about 15 to 
about 25 amino acid residues in length. 
[0178] Tags used to prepare compositions of the invention include those 

which contribute to the thermostability of the fusion protein. Thus, these tags 
may be at least partly responsible, for example, for a particular protein (e.g., a 
protein having one or more activities of a reverse transcriptase activity) having 
increased thermostability. Examples of tags that enhance the thermostability 
of a reverse transcriptase (i.e., M-MLV reverse transcriptase) include, but are 
not limited to, tags having the following amino acid sequences: 
MGGSHHHHHHGMASMTGGQQMGRDLYDDDDKH, which corresponds 
to amino acids 1-32 of the sequence set forth in SEQ ID NO:2 and Table 3, 
and MASGTGGQQMGRDLYDDDDKH, (SEQ ID NO:3). Fragments of 
these tags may also be used in accordance with the invention (preferably those 
having 3 or more, 5 or more, 10 or more, or 15 or more amino acids) Thus, the 
invention includes, in part, reverse transcriptases, or fragments thereof that 
comprise tags and demonstrate enhanced thermostability. Using well known 
methods, one of skill in the art can attach one or more of above-mentioned 
tags to one or more RT enzymes, or fragments thereof having reverse 
transcriptase activity, to produce a thermostable RT enzyme or fragment 
thereof. Suitable RT enzymes include, but are not limited to, retroviral reverse 
transcriptase, retrotransposon reverse transcriptase, hepatitis B reverse 
transcriptase, cauliflower mosaic virus reverse transcriptase, bacterial reverse 
transcriptase, Tth DNA polymerase, Tag DNA polymerase (Saiki, R.K., et al. 9 
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Science 239:487-491 (1988,); U.S. Patent Nos. 4,889,818 and 4,965,188), Tne 
DNA polymerase (PCT Publication No. WO 96/10640), Tma DNA 
polymerase (U.S. Patent No. 5,374,553) and mutants, fragments, variants or 
derivatives thereof {see, e.g., commonly owned U.S. Patent Nos. 5,948,614 
and 6,015,668, which are incorporated by reference herein in their entireties). 
Reverse transcriptases for use in the invention also include retroviral reverse 
transcriptases such as M-MLV reverse transcriptase, AMV reverse 
transcriptase, RSV reverse transcriptase, RAV reverse transcriptase, MAV 
reverse transcriptase, and generally ASLV reverse transcriptases. 
[0179] Tags used in the practice of the invention may serve any number of 

purposes and a number of tags may be added to impart one or more different 
* functions to the reverse transcriptase of the invention.. For example, tags may 
(1) contribute to protein-protein interactions both internally within a protein 
and with other protein molecules, (2) make the protein amenable to particular 
purification methods, (3) enable one to identify whether the protein is present 
in a composition; or (4) give the protein other functional characteristics. 
[0180] Examples of tags which may be used in the practice of the invention 

include metal binding domains (e.g., a poly-histidine segments such as a three, 
four, five, six, or seven histidine region), immunoglobulin binding domains 
(e.g., (1) Protein A; (2) Protein G; (3) T cell, B cell, and/or Fc receptors; 
and/or (4) complement protein antibody-binding domain); sugar binding 
domains (e.g., a maltose binding domain, chitin-binding domain); and 
detectable domains (e.g., at least a portion of beta-galactosidase). Fusion 
proteins may contain one or more tags such as those described above. 
Typically, fusion proteins that contain more than one tag will contain these 
tags at one terminus or both termini (i.e., the N-terminus and the C-terminus) 
of the reverse transcriptase, although one or more tags may be located 
internally instead of or in addition to those present at termini. Further, more 
than one tag may be present at one terminus, internally and/or at both termini 
of the reverse transcriptase. For example, three consecutive tags could be 
linked end-to-end at the N-terminus of the reverse transcriptase. The 
invention further include compositions and reaction mixture which contain the 
above fusion proteins, as well as methods for preparing these fusion proteins, 
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nucleic acid molecules {e.g., vectors) which encode these fusion proteins and 
recombinant host cells which contain these nucleic acid molecules. The 
invention also includes methods for using these fusion proteins as described 
elsewhere herein {e.g., methods for reverse transcribing nucleic acid 
molecules). 

[0181] Tags which enable one to identify whether the fusion protein is present 

in a composition include, for example, tags which can be used to identify the 
protein in an electrophoretic gel. A number of such tags are known in the art 
and include epitopes and antibody binding domains which can be used for 
Western blots. 

[0182] The amino acid composition of the tags for use in the present invention 

may vary. "In some embodiments, a tag may contain from about 1% to about 
5% amino acids that have a positive charge at physiological pH, e.g., lysine, 
arginine, and histidine, or from about 5% to about 10% amino acids that have 
a positive charge at physiological pH, or from about 10% to about 20% amino 
acids that have a positive charge at physiological pH, or from about 10% to 
about 30% amino acids that have a positive charge at physiological pH, or 
from about 10% to about 50% amino acids that have a positive charge at 
physiological pH, or from about 10% to about 75% amino acids that have a 
positive charge at physiological pH. In some embodiments, a tag may contain 
from about 1% to about 5% amino acids that have a negative charge at 
physiological pH, e.g., aspartic acid and glutamic acid, or from about 5% to 
about 10%> amino acids that have a negative charge at physiological pH, or 
from about 10% to about 20% amino acids that have a negative charge at 
physiological pH, or from about 10% to about 30% amino acids that have a 
negative charge at physiological pH, or from about 10% to about 50% amino 
acids that have a negative charge at physiological pH, or from about 10% to 
about 75% amino acids that have a negative charge at physiological pH. In 
some embodiments, a tag may comprise a sequence of amino acids that 
contains two or more contiguous charged amino acids that may be the same or 
different and may be of the same or different charge. For example, a tag may 
contain a series {e.g., two, three, four, five, six, ten etc.) of positively charged 
amino acids that may be the same or different. A tag may contain a series 



-76- 

(e.g., two, three, four, five, six, ten etc.) of negatively charged amino acids that 
may be the same or different. In some embodiments, a tag may contain a 
series (e.g., two, three, four, five, six, ten etc.) of alternating positively 
charged and negatively charged amino acids that may be the same or different 
(e.g., positive, negative, positive, negative, etc.). Any of the above-described 
series of amino acids (e.g., positively charged, negatively charged or 
alternating charge) may comprise one or more neutral polar or non-polar 
amino acids (e.g., two, three, four, five, six, ten etc.) spaced between the 
charged amino acids. Such neutral amino acids may be evenly distributed 
through out the series of charged amino acids (e.g., charged, neutral, charged, 
neutral) or may be unevenly distributed throughout the series (e.g., charged, a 
plurality of neutral, charged, neutral, a plurality of charged, etc.). In some 
embodiments, tags to be attached to the polypeptides of the invention may 
have an overall charge at physiological pH (e.g., positive charge or negative 
charge). The size of the overall charge may vary, for example, the tag may 
contain a net plus one, two, three, four, five, etc. or may possess a net negative 
one, two, three, four, five, etc. The present invention also relates to reverse 
transcriptases (e.g., thermostable reverse transcriptases) comprising one or 
more of the above-described tag sequences, vectors encoding such reverse 
transcriptases, host cells reaction mixture, compositions and reaction mixtures 
comprising such reverse transcriptases, as well as kits comprising containers 
containing such reverse transcriptases. 

In some embodiments, it may be desirable to remove all or a portion of 
a tag sequence from a fusion protein comprising a tag sequence and a 
sequence having reverse transcriptase (RT) activity. In embodiments of this 
type, one or more amino acids forming a cleavage site, e.g., for a protease 
enzyme, may be incorporated into the primary sequence of the fusion protein. 
The cleavage site may be located such that cleavage at the site may remove all 
or a portion of the tag sequence from the fusion protein. In some 
embodiments, the cleavage site may be located between the tag sequence and 
the sequence having RT activity such that all of the tag sequence is removed 
by cleavage with a protease enzyme that recognizes the cleavage site. 
Examples of suitable cleavage sites include, but are not limited to, the Factor 
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Xa cleavage site having the sequence Ile-Glu-Gly-Arg (SEQ ID NO:4), which 
is recognized and cleaved by blood coagulation factor Xa, and the thrombin 
cleavage site having the sequence Leu-Val-Pro-Arg (SEQ ID NO: 5), which is 
recognized and cleaved by thrombin. Other suitable cleavage sites are known 
to those skilled in the art and may be used in conjunction with the present 
invention. 

[0184] The reverse transcriptases of the invention preferably have specific 

activities (e.g., RNA-directed DNA polymerase activity and/or RNase H 
activity) greater than about 5 units/mg, more preferably greater than about 50 
units/mg, still more preferably greater than about 100 units/mg 5 250 units/mg, 
500 units/mg, 1000 units/mg, 5000 units/mg or 10,000 units/mg, and most 
preferably greater than about 15,000 units/mg, greater than about 16,000 
units/mg, greater than about 17,000 units/mg, greater than about 18,000 
units/mg, greater than about 19,000 units/mg and greater than about 20,000 
units/mg. In some embodiments, the reverse transcriptases of the present 
invention may have specific activities greater than about 50,000 units/mg, 
greater than about 100,000 units/mg, greater than about 150,000 units/mg, 
greater than about 200,000 units/mg, greater than about 250,000 units/mg and 
greater than about 300,000 units/mg. Preferred ranges of specific activities for 
the reverse transcriptases of the invention include a specific activity from 
about 5 units/mg to about 750,000 units/mg, a specific activity from about 5 
units/mg to about 500,000 units/mg, from about 5 units/mg to about 300,000 
units/mg, a specific activity of from about 50 units/mg to about 750,000 
units/mg, a specific activity from about 100 units/mg to about 750,000 
units/mg, a specific activity from about 250 units/mg to about 750,000 
units/mg, a specific activity from about 500 units/mg to about 750,000 
units/mg, a specific activity from about 1000 units/mg to about 750,000 
units/mg, a specific activity from about 5000 units/mg to about 750,000 
units/mg, a specific activity from about 10,000 units/mg to about 750,000 
units/mg, a specific activity from about 25,000 units/mg to about 750,000 
units/mg, a specific activity from about 100 units/mg to about 500 units/mg, a 
specific activity from about 100 units/mg to about 400 units/mg, and a specific 
activity from about 200 units/mg to about 500 units/mg. Other preferred 
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ranges of specific activities include a specific activity of from about 200,000 
units/mg to about 350,000 units/mg, a specific activity from about 225,000 
units/mg to about 300,000 units/mg, a specific activity from about 250,000 
units/mg to about 300,000 units/mg, a specific activity of from about 200,000 
units/mg to about 750,000 units/mg, a specific activity of from about 200,000 
units/mg to about 500,000 units/mg, a specific activity of from about 200,000 
units/mg to about 400,000 units/mg, a specific activity of from about 250,000 
units/mg to about 750,000 units/mg, a specific activity of from about 250,000 
units/mg to about 500,000 units/mg, and a specific activity of from about 
250,000 units/mg to about 400,000 units/mg. Preferably, the lower end of the 
specific activity range may vary from 50, 100, 200, 300, 400, 500, 700, 900, 
1,000, 5,000, 10,000, 20,000, 30,000, 35,000, 40,000, 45,000, 50,000, 55,000, 
60,000, 65,000, 70,000, 75,000, and 80,000 units/mg, while the upper end of 
the range may vary from 750,000, 650,000, 600,000, 550,000, 500,000, 
450,000, 400,000, 350,000, 300,000, 250,000, 200,000, 150,000, 140,000, 
130,000, 120,000, 110,000, 100,000, and 9.0,000 units/mg. Specific activity 
may be determined using enzyme assays and protein assays well known in the 
art. DNA polymerase assays and RNase H assays are described, for example, 
in U.S. patent no. 5,244,797 and WO 98/47912. In some embodiments of the 
present invention, the specific activity of the thermostable reverse 
transcriptase prepared in accordance with the present invention may be higher 
than the specific activity of a non-thermostable reverse transcriptase. In some 
embodiments, the specific activity of the thermostable reverse transcriptase 
may be 5%, 10,%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 
100% or more higher than the specific activity of a corresponding non- 
thermostable reverse transcriptase. In some preferred embodiments, the 
specific activity of the thermostable reverse transcriptase according to the 
present invention may be 30% or more higher than the specific activity of a 
corresponding non-thermostable reverse transcriptase. In accordance with the 
invention, specific activity is a measurement of the enzymatic activity (in 
units) of the protein or enzyme relative to the total amount of protein or 
enzyme used in a reaction. The measurement of a specific activity may be 
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determined by standard techniques well-known to one of ordinary skill in the 
art. 

[0185] The reverse transcriptases of the invention may be used to make 

nucleic acid molecules from one or more templates. Such methods comprise 
mixing one or more nucleic acid templates {e.g., mRNA, and more preferably 
a population of mRNA molecules) with one or more of the reverse 
transcriptases of the invention and incubating the mixture under conditions 
sufficient to make one or more nucleic acid molecules complementary to all or 
a portion of the one or more nucleic acid templates. 

[0186] The invention also relates to methods for the amplification of one or 

more nucleic acid molecules comprising mixing one or more nucleic acid 
templates with one of the reverse transcriptases of the invention, and 
incubating the mixture under conditions sufficient to amplify one or more 
nucleic acid molecules complementary to all or a portion of the one or more 
nucleic acid templates. Such amplification methods may further comprise the 
use of one or more DNA polymerases and may be employed as in standard 
RT-PCR reactions. 

[0187] The invention also concerns methods for the sequencing of one or 

more nucleic acid molecules comprising (a) mixing one or more nucleic acid 
molecules to be sequenced with one or more primer nucleic acid molecules, 
one or more reverse transcriptases of the invention, one or more nucleotides 
and one or more terminating agents; (b) incubating the mixture under 
conditions sufficient to synthesize a population of nucleic acid molecules 
complementary to all or a portion of the one or more nucleic acid molecules to 
be sequenced; and (c) separating the population of nucleic acid molecules to 
determine the nucleotide sequence of all or a portion of the one or more 
nucleic acid molecules to be sequenced. 

[0188] The invention also concerns nucleic acid molecules produced by such 

methods (which may be full-length cDNA molecules), vectors (particularly 
expression vectors) comprising these nucleic acid molecules and host cells 
comprising these vectors and nucleic acid molecules. 
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Sources of DNA Polymerase 

[0189] A variety of DNA polymerases are useful in accordance with the 

present invention. Such polymerases include, but are not limited to, Thermus 
thermophilus (Tth) DNA polymerase, Thermus aquaticus (Tag) DNA 
polymerase, Thermotoga neapolitana (Tne) DNA polymerase, Thermotoga 
maritima (Tma) DNA polymerase, Thermococcus litoralis (Tli or VENT™) 
DNA polymerase, Thermococcus kodakaraensis KOD1 DNA Polymerase, 
Pyrococcus furiosis (Pfu) DNA polymerase, Pyrococcus species GB-D 
(DEEPVENT™) DNA polymerase, Pyrococcus woosii (Pwo) DNA 
polymerase, Bacillus sterothermophilus (Bst) DNA polymerase, Bacillus 
caldophilus {Bed) DNA polymerase, Sulfolobus acidocaldarius (Sac) DNA 
polymerase, Thermoplasma acidophilum (Tac) DNA polymerase, Thermus 
flavus (Tfl/Tub) DNA polymerase, Thermus ruber (Tru) DNA polymerase, 
Thermus brockianus (DYNAZYME™) DNA polymerase, Methanobacterium 
thermoautotrophicum (Mth) DNA polymerase, Mycobacterium spp. DNA 
polymerase (Mtb, Mlep), and mutants, variants and derivatives thereof. 
[0190] DNA polymerases used in accordance with the invention may be any 

enzyme that can synthesize a DNA molecule from a nucleic acid template, 
typically in the 5' to 3' direction. Such polymerases may be mesophilic or 
thermophilic, but are preferably thermophilic. Mesophilic polymerases 
include T5 DNA polymerase, T7 DNA polymerase, Klenow fragment DNA 
polymerase, DNA polymerase III, and the like. Preferred DNA polymerases 
are thermostable DNA polymerases such as Taq, Tne, Tma, Pfu, VENT™, 
DEEPVENT™, Tth and mutants, variants and derivatives thereof (U.S. Patent 
No. 5,436,149; U.S. Patent No. 5,512,462; PCT Publication No. WO 
92/06188; PCT Publication No. WO 92/06200; PCT Publication No. WO 
96/10640; Barnes, W.M., Gene 1 72:29-35 (1992); Lawyer, F.C., et aL, PCR 
Meth. Appl 2:275-287 (1993); Flaman, J.-M., et al., Nucl Acids Res, 
22(1 5) :3259 -3260 (1994)). For amplification of long nucleic acid molecules 
(e.g., nucleic acid molecules longer than about 3-5 Kb in length), at least two 
DNA polymerases (one substantially lacking 3' exonuclease activity and the 
other having 3' exonuclease activity) are typically used. See U.S. Patent No. 



-81 - 

5,436,149; U.S. Patent No. 5,512,462; Barnes, W.M., Gene 772:29-35 (1992); 
PCT Publication No. WO 98/06736; and commonly owned, co-pending U.S. 
Patent Application No. 08/801,720, filed February 14, 1997, the disclosures of 
all of which are incorporated herein in their entireties. Examples of DNA 
polymerases substantially lacking in 3' exonuclease activity include, but are 
not limited to, Taq, 7>ze(exo~), Tma, P/u(exo"), Pwo and Tth DNA 
polymerases, and mutants, variants and derivatives thereof. Nonlimiting 
examples of DNA polymerases having 3 f exonuclease activity include Pfu, 
DEEPVENT™ and 77//VENT™ and mutants, variants and derivatives thereof 

Formulation of Compositions and Reaction Mixtures 

To form the compositions of the present invention, one or more reverse 
transcriptases are preferably admixed in a buffered salt solution. One or more 
DNA polymerases and/or one or more nucleotides, and/or one or more primers 
may optionally be added to make the compositions of the invention. More 
preferably, the enzymes are provided at working concentrations in stable 
buffered salt solutions. The terms "stable" and "stability" as used herein 
generally mean the retention by a composition, such as an enzyme 
composition, of at least 70%, preferably at least 80%, and most preferably at 
least 90%, of the original enzymatic activity (in units) after the enzyme or 
composition containing the enzyme has been stored for about one week at a 
temperature of about 4°C, about two to six months at a temperature of about 
-20°C, and about six months or longer at a temperature of about -80°C. As 
used herein, the term "working concentration" means the concentration of an 
enzyme that is at or near the optimal concentration used in a solution to 
perform a particular function (such as reverse transcription of nucleic acids). 

The water used in forming the compositions of the present invention is 
preferably distilled, deionized and sterile filtered (through a 0.1-0.2 
micrometer filter), and is free of contamination by DNase and RNase 
enzymes. Such water is available commercially, for example from Sigma 
Chemical Company (Saint Louis, Missouri), or may be made as needed 
according to methods well known to those skilled in the art. 
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[0193] In addition to the enzyme components, the present compositions 

preferably comprise one or more buffers and cofactors necessary for synthesis 
of a nucleic acid molecule such as a cDNA molecule. Particularly preferred 
buffers for use in forming the present compositions are the acetate, sulfate, 
hydrochloride, phosphate or free acid forms of Tris- 
(hydroxymethyl)aminomethane (TRIS®), although alternative buffers of the 
same approximate ionic strength and pKa as TRIS® may be used with 
equivalent results. In addition to the buffer salts, cofactor salts such as those 
of potassium (preferably potassium chloride or potassium acetate) and 
magnesium (preferably magnesium chloride or magnesium acetate) are 
included in the compositions. Addition of one or more carbohydrates and/or 
sugars to the compositions and/or synthesis reaction mixtures may also be 
advantageous, to support enhanced stability of the compositions and/or 
reaction mixtures upon storage. Preferred such carbohydrates or sugars for 
inclusion in the compositions and/or synthesis reaction mixtures of the 
invention include, but are not limited to, sucrose, trehalose, glycerol, and the 
like. Furthermore, such carbohydrates and/or sugars may be added to the 
storage buffers for the enzymes used in the production of the enzyme 
compositions and kits of the invention. Such carbohydrates and/or sugars are 
commercially available from a number of sources, including Sigma (St. Louis, 
MO). 

[0194] It is often preferable to first dissolve the buffer salts, cofactor salts and 

carbohydrates or sugars at working concentrations in water and to adjust the 
pH of the solution prior to addition of the enzymes. In this way, the pH- 
sensitive enzymes will be less subject to acid- or alkaline-mediated 
inactivation during formulation of the present compositions. 

[0195] To formulate the buffered salts solution, a buffer salt which is 

preferably a salt of Tris(hydroxymethyl)aminomethane (TRIS®), and most 
preferably the hydrochloride salt thereof, is combined with a sufficient 
quantity of water to yield a solution having a TRIS® concentration of 5-150 
millimolar, preferably 10-60 millimolar, and most preferably about 20-60 
millimolar. To this solution, a salt of magnesium (preferably either the 
chloride or acetate salt thereof) or other divalent cation, may be added to 
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provide a working concentration thereof of 1-10 millimolar, preferably 1.5-8.0 
millimolar, and most preferably about 3-7.5 millimolar. A salt of potassium 
(preferably a chloride or acetate salt of potassium), or other monovalent cation 
(e.g., Na) 5 may also be added to the solution, at a working concentration of 10- 
100 millimolar and most preferably about 75 millimolar. A reducing agent, 
such as dithiothreitol, may be added to the solution, preferably at a final 
concentration of about 1-100 mM, more preferably a concentration of about 5- 
50 mM or about 7.5-20 mM, and most preferably at a concentration of about 
1 0 mM. Preferred concentrations of carbohydrates and/or sugars for inclusion 
in the compositions of the invention range from about 5% (w/v) to about 30% 
(w/v), from about 7.5% (w/v) to about 25% (w/v), from about 10% (w/v) to 
about 25% (w/v), from about 10% (w/v) to about 20% (w/v), and preferably 
from about 10% (w/v) to about 15% (w/v). A small amount of a salt of 
ethylenediaminetetraacetate (EDTA), such as disodium EDTA, may also be 
added (preferably about 0.1 millimolar), although inclusion of EDTA does not 
appear to be essential to the function or stability of the compositions of the 
present invention. After addition of all buffers and salts, this buffered salt 
solution is mixed well until all salts are dissolved, and the pH is adjusted using 
methods known in the art to a pH value of from about 7.4 to about 9.2, 
preferably from about 8.0 to about 9.0, and most preferably to about 8.4. 
[0196] To these buffered salt solutions, the enzymes (reverse transcriptases 

and/or DNA polymerases) are added to produce the compositions of the 
present invention. M-MLV reverse transcriptases are preferably added at a 
working concentration in the solution of from about 1,000 to about 50,000 
units per milliliter, from about 2,000 to about 30,000 units per milliliter, from 
about 2,500 to about 25,000 units per milliliter, from about 3,000 to about 
22,500 units per milliliter, from about 4,000 to about 20,000 units per 
milliliter, and most preferably at a working concentration of from about 5,000 
to about 20,000 units per milliliter. In some embodiments, a reverse 
transcriptases of the invention (e.g., an M-MLV reverse transcriptase) may be 
added at a working concentration described above in a first strand reaction 
mixture (e.g., a reaction to reverse transcribe an mRNA molecule) and/or in a 
couple RT/PCR. A suitable concentration of a reverse transcriptase of the 
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invention for these reactions may be from about 5,000 units/ml to about 
50,000 units/ml, from about 5,000 units/ml to about 40,000 units/ml, from 
about 5,000 units/ml to about 30,000 units/ml, or from about 5,000 units/ml to 
about 20,000 units/ml of reverse transcriptase. A reaction may be conducted 
in a volume of 20 yi\ to 50 (il and such a reaction may contain 50 units, 100, 
units, 200 units, 300 units, 400 units or more of a reverse transcriptase of the 
invention. Those skilled in the art will appreciate that adding additional 
reverse transcriptase may allow increased synthesis of the first strand (e.g., the 
DNA strand complementary to the mRNA strand) at the expense of increased 
enzyme usage. The skilled artisan can determine without undue 
experimentation the amount of a reverse transcriptase of the invention to add 
to a reaction in order to produce a desired amount of product at an acceptable 
expense. 

[0197] AMV reverse transcriptases, RSV reverse transcriptases and HIV 

reverse transcriptases, including those of the invention described above, are 
preferably added at a working concentration in the solution of from about 100 
to about 5000 units per milliliter, from about 125 to about 4000 units per 
milliliter, from about 150 to about 3000 units per milliliter, from about 200 to 
about 2500 units per milliliter, from about 225 to about 2000 units per 
milliliter, and most preferably at a working concentration of from about 250 to 
about 1000 units per milliliter. The enzymes in the thermophilic DNA 
polymerase group (Taq, Tne, Tma, Pfu, VENT, DEEPVENT, Tth and mutants, 
variants and derivatives thereof) are preferably added at a working 
concentration in the solution of from about 100 to about 1000 units per 
milliliter, from about 125 to about 750 units per milliliter, from about 150 to 
about 700 units per milliliter, from about 200 to about 650 units per milliliter, 
from about 225 to about 550 units per milliliter, and most preferably at a 
working concentration of from about 250 to about 500 units per milliliter. The 
enzymes may be added to the solution in any order, or may be added 
simultaneously. 

[0198] The compositions of the invention may further comprise one or more 

nucleotides, which are preferably deoxynucleoside triphosphates (dNTPs) or 
dideoxynucleoside triphosphates (ddNTPs). The dNTP components of the 
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present compositions serve as the "building blocks" for newly synthesized 
nucleic acids, being incorporated therein by the action of the polymerases, and 
the ddNTPs may be used in sequencing methods according to the invention. 
Examples of nucleotides suitable for use in the present compositions include, 
but are not limited to, dUTP, dATP, dTTP, dCTP, dGTP, dITP, 7-deaza- 
dGTP, (X-thio-dATP, (X-thio-dTTP, (X-thio-dGTP, <x-thio-dCTP, ddUTP, 
ddATP, ddTTP, ddCTP, ddGTP, ddlTP, 7-deaza-ddGTP, (X-thio-ddATP, (X- 
thio-ddTTP, (X-thio-ddGTP, (X-thio-ddCTP or derivatives thereof, all of which 
are available commercially from sources including Invitrogen Corporation 
(Carlsbad, CA), New England BioLabs (Beverly, Massachusetts) and Sigma 
Chemical Company (Saint Louis, Missouri). The nucleotides may be 
unlabeled, or they may be detectably labeled by coupling them by methods 
known in the art with radioisotopes (e.g., 3 H, 14 C, 32 P or 35 S), vitamins (e.g., 
biotin), fluorescent moieties (e.g., fluorescein, rhodamine, Texas Red, or 
phycoerythrin), chemiluminescent labels (e.g., using the PHOTO-GENE™ or 
ACES™ chemiluminescence systems, available commercially from Invitrogen 
Corporation (Carlsbad, CA)), dioxigenin and the like. Labeled nucleotides 
may also be obtained commercially, for example from Invitrogen Corporation 
(Carlsbad, CA) or Sigma Chemical Company (Saint Louis, Missouri). In the 
present compositions, the nucleotides are added to give a working 
concentration of each nucleotide of about 10-4000 micromolar, about 50-2000 
micromolar, about 100-1500 micromolar, or about 200-1200 micromolar, and 
most preferably a concentration of about 1000 micromolar. 
[0199] To reduce component deterioration, storage of the reagent 

compositions is preferably at about 4°C for up to one day, or most preferably 
at -20°C for up to one year. 
[0200] In another aspect, the compositions and reverse transcriptases of the 

invention may be prepared and stored in dry form in the presence of one or 
more carbohydrates, sugars, or synthetic polymers. Preferred carbohydrates, 
sugars or polymers for the preparation of dried compositions or reverse 
transcriptases include, but are not limited to, sucrose, trehalose, and 
polyvinylpyrrolidone (PVP) or combinations thereof. See, e.g., U.S. Patent 
Nos. 5,098,893, 4,891,319, and 5,556,771, the disclosures of which are 
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entirely incorporated herein by reference. Such dried compositions and 
enzymes may be stored at various temperatures for extended times without 
significant deterioration of enzymes or components of the compositions of the 
invention. Preferably, the dried reverse transcriptases or compositions are 
stored at 4°C or at -20°C. 

[0201] The invention further includes reaction solutions for reverse 

transcribing nucleic acid molecules, as well as reverse transcription methods 
employing such reaction solutions and product nucleic acid molecules 
produced using such methods. In many instances, reaction solutions of the 
invention will contain one or more of the following components: (1) one or 
more buffering agent (e.g., sodium phosphate, sodium acetate, 2-(N- 
moropholino)-ethanesulfonic acid (MES), tris-(hydroxymethyl)aminomethane 
(Tris), 3-(cyclohexylamino)-2-hydroxy- 1 -propanesulfonic acid (CAPS), 
citrate, N-2-hydroxyethylpiperazine-N f -2-ethanesulfonic acid (HEPES), 
acetate, 3-(N-morpholino)prpoanesulfonic acid (MOPS), N- 
tris(hydroxymethyl)methyl-3-aminopropanesulfonio acid (TAPS), etc.), (2) 
one or more monovalent cationic salt (e.g., NaCl, KC1, etc.), (3) one or more 
divalent cationic salt (e.g., MnCl2, MgCl2, MgSC>4, CaCl2, etc.), (4) one or 
more reducing agent (e.g., dithiothreitol, P-mercaptoethanol, etc.), (5) one or 
more ioninc or non-ionic detergent (e.g., Triton X-100™, Nonidet P40™, 
sodium dodecyl sulphate, etc.), (6) one or more DNA polymerase inhibitor 
(e.g., Actinomycin D, etc.), (7) nucleotides (e.g., dNTPs, such as dGTP, 
dATP, dCTP, dTTP, etc.), (8) RNA to be reverse transcribed and/or amplified, 
(9) one or more RNase inhibitor (e.g., RNaseOut™, Invitrogen Corporation, 
Carlsbad, CA, catalog number 10777-019 etc.), (10) a reverse transcriptase 
(e.g., a reverse transcriptase of the invention, and/or (11) one or more diluent 
(e.g., water). Other components and/or constituents (e.g., primers, DNA 
polymerases, etc.) may also be present in reaction solutions. 

[0202] The concentration of the buffering agent in the reaction solutions of the 

invention will vary with the particular buffering agent used. Typically, the 
working concentration (i.e., the concentration in the reaction mixture) of the 
buffering agent will be from about 5 mM to about 500 mM (e.g., about 10 
mM, about 15 mM, about 20 mM, about 25 mM, about 30 mM, about 35 mM, 
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about 40 mM, about 45 mM, about 50 mM, about 55 mM, about 60 mM, 
about 65 mM, about 70 mM, about 75 mM, about 80 mM, about 85 mM, 
about 90 mM, about 95 mM, about 100 mM, from about 5 mM to about 500 
mM, from about 10 mM to about 500 mM, from about 20 mM to about 500 
mM, from about 25 mM to about 500 mM, from about 30 mM to about 500 
mM, from about 40 mM to about 500 mM, from about 50 mM to about 500 
mM, from about 75 mM to about 500 mM, from about 100 mM to about 500 
mM, from about 25 mM to about 50 mM, from about 25 mM to about 75 mM, 
from about 25 mM to about 100 mM, from about 25 mM to about 200 mM, 
from about 25 mM to about 300 mM, etc.). When Tris (e.g., Tris-HCl) is 
used, the Tris working concentration will typically be from about 5 mM to 
about 100 mM, from about 5 mM to about 75 mM, from about lO-mM to 
about 75 mM, from about 10 mM to about 60 mM, from about 10 mM to 
about 50 mM, from about 25 mM to about 50 mM, etc. 
[0203] The final pH of solutions of the invention will generally be set and 

maintained by buffering agents present in reaction solutions of the invention. 
The pH of reaction solutions of the invention, and hence reaction mixtures of 
the invention, will vary with the particular use and the buffering agent present 
but will often be from about pH 5.5 to about pH 9.0 (e.g., about pH 6.0, about 
pH 6.5, about pH 7.0, about pH 7.1, about pH 7.2, about pH 7.3, about pH 7.4, 
about pH 7.5, about pH 7.6, about pH 7.7, about pH 7.8, about pH 7.9, about 
pH 8.0, about pH 8.1, about pH 8.2, about pH 8.3, about pH 8.4, about pH 8.5, 
about pH 8.6, about pH 8.7, about pH 8.8, about pH 8.9, about pH 9.0, from 
about pH 6.0 to about pH 8.5, from about pH 6.5 to about pH 8.5, from about 
pH 7.0 to about pH 8.5, from about pH 7.5 to about pH 8.5, from about pH 6.0 
to about pH 8.0, from about pH 6.0 to about pH 7.7, from about pH 6.0 to 
about pH 7.5, from about pH 6.0 to about pH 7.0, from about pH 7.2 to about 
pH 7.7, from about pH 7.3 to about pH 7.7, from about pH 7.4 to about pH 
7.6, from about pH 7.0 to about pH 7.4, from about pH 7.6 to about pH 8.0, 
from about pH 7.6 to about pH 8.5, from about pH 7.7 to about pH 8.5, from 
about pH 7.9 to about pH 8.5, from about pH 8.0 to about pH 8.5, from about 
pH 8.2 to about pH 8.5, from about pH 8.3 to about pH 8.5, from about pH 8.4 
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to about pH 8.5, from about pH 8.4 to about pH 9.0, from about pH 8.5 to 
about pH 9.0, etc.) 

As indicated, one or more monovalent cationic salts (e.g., NaCl, KC1, 
etc.) may be included in reaction solutions of the invention. In many 
instances, salts used in reaction solutions of the invention will dissociate in 
solution to generate at least one species which is monovalent (e.g., Na+, K+, 
etc.) When included in reaction solutions of the invention, salts will often be 
present either individually or in a combined concentration of from about 0.5 
mM to about 500 mM (e.g., about 1 mM, about 2 mM, about 3 mM, about 5 
mM, about 10 mM, about 12 mM, about 15 mM, about 17 mM, about 20 mM, 
about 22 mM, about 23 mM, about 24 mM, about 25 mM, about 27 mM, 
about 30 mM, about 35 mM, about 40 mM, about 45 mM, about 50 mM, 
about 55 mM, about 60 mM, about 64 mM, about 65 mM, about 70 mM, 
about 75 mM, about 80 mM, about 85 mM, about 90 mM, about 95 mM, 
about 100 mM, about 120 mM, about 140 mM, about 150 mM, about 175 
mM, about 200 mM, about 225 mM, about 250 mM, about 275 mM, about 
300 mM, about 325 mM, about 350 mM, about 375 mM, about 400 mM, from 
about 1 mM to about 500 mM, from about 5 mM to about 500 mM, from 
about 10 mM to about 500 mM, from about 20 mM to about 500 mM, from 
about 30 mM to about 500 mM, from about 40 mM to about 500 mM, from 
about 50 mM to about 500 mM, from about 60 mM to about 500 mM, from 
about 65 mM to about 500 mM, from about 75 mM to about 500 mM, from 
about 85 mM to about 500 mM, from about 90 mM to about 500 mM, from 
about 100 mM to about 500 mM, from about 125 mM to about 500 mM, from 
about 150 mM to about 500 mM, from about 200 mM to about 500 mM, from 
about 10 mM to about 100 mM, from about 10 mM to about 75 mM, from 
about 10 mM to about 50 mM, from about 20 mM to about 200 mM, from 
about 20 mM to about 150 mM, from about 20 mM to about 125 mM, from 
about 20 mM to about 100 mM, from about 20 mM to about 80 mM, from 
about 20 mM to about 75 mM, from about 20 mM to about 60 mM, from 
about 20 mM to about 50 mM, from about 30 mM to about 500 mM, from 
about 30 mM to about 100 mM, from about 30 mM to about 70 mM, from 
about 30 mM to about 50 mM, etc.). 
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[0205] As indicated, one or more divalent cationic salts (e.g., MnCl2, MgCl2, 

MgS04, CaCl2, etc.) may be included in reaction solutions of the invention. 
In many instances, salts used in reaction solutions of the invention will 
dissociate in solution to generate at least one species which is monovalent 
(e.g., Mg ++ , Mn ++ , Ca ++ , etc.) When included in reaction solutions of the 
invention, salts will often be present either individually or in a combined 
concentration of from about 0.5 mM to about 500 mM (e.g., about 1 mM, 
about 2 mM, about 3 mM, about 4 mM, about 5 mM, about 6 mM, about 7 
mM, about 8 mM, about 9 mM, about 10 mM, about 12 mM, about 15 mM, 
about 17 mM, about 20 mM, about 22 mM, about 23 mM, about 24 mM, 
about 25 mM, about 27 mM, about 30 mM, about 35 mM, about 40 mM, 
about 45 mM, about 50 mM, about 55 mM, about 60 mM, about 64 mM, 
about 65 mM, about 70 mM, about 75 mM, about 80 mM, about 85 mM, 
about 90 mM, about 95 mM, about 100 mM, about 120 mM, about 140 mM, 
about 150 mM, about 175 mM, about 200 mM, about 225 mM, about 250 
mM, about 275 mM, about 300 mM, about 325 mM, about 350 mM, about 
375 mM, about 400 mM, from about 1 mM to about 500 mM, from about 5 
mM to about 500 mM, from about 10 mM to about 500 mM, from about 20 
mM to about 500 mM, from about 30 mM to about 500 mM, from about 40 
mM to about 500 mM, from about 50 mM to about 500 mM, from about 60 
mM to about 500 mM, from about 65 mM to about 500 mM, from about 75 
mM to about 500 mM, from about 85 mM to about 500 mM, from about 90 
mM to about 500 mM, from about 100 mM to about 500 mM, from about 125 
mM to about 500 mM, from about 150 mM to about 500 mM, from about 200 
mM to about 500 mM, from about 10 mM to about 100 mM, from about 10 
mM to about 75 mM, from about 10 mM to about 50 mM, from about 20 mM 
to about 200 mM, from about 20 mM to about 150 mM, from about 20 mM to 
about 125 mM, from about 20 mM to about 100 mM, from about 20 mM to 
about 80 mM, from about 20 mM to about 75 mM, from about 20 mM to 
about 60 mM, from about 20 mM to about 50 mM, from about 30 mM to 
about 500 mM, from about 30 mM to about 100 mM, from about 30 mM to 
about 70 mM, from about 30 mM to about 50 mM, etc.). 
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When included in reaction solutions of the invention, reducing- agents 
{e.g., dithiothreitol, (3-mercaptoethanol, etc.) will often be present either 
individually or in a combined concentration of from about 0.1 mM to about 50 
mM (e.g., about 0.2 mM, about 0.3 mM, about 0.5 mM, about 0.7 mM, about 
0.9 mM, about 1 mM, about 2 mM, about 3 mM, about 4 mM, about 5 mM, 
about 6 mM, about 10 mM, about 12 mM, about 15 mM, about 17 mM, about 
20 mM, about 22 mM, about 23 mM, about 24 mM, about 25 mM, about 27 
mM, about 30 mM, about 35 mM, about 40 mM, about 45 mM, about 50 mM, 
from about 0.1 mM to about 50 mM, from about 0.5 mM to about 50 mM, 
from about 1 mM to about 50 mM, from about 2 mM to about 50 mM, from 
about 3 mM to about 50 mM, from about 0.5 mM to about 20 mM, from about 
0.5 mM to about 10 mM, from about 0.5 mM to about 5 mM, from about 0.5 
mM to about 2.5 mM, from about 1 mM to about 20 mM, from about 1 mM to 
about 10 mM, from about 1 mM to about 5 mM, from about 1 mM to about 
3.4 mM, from about 0.5 mM to about 3.0 mM, from about 1 mM to about 3.0 
mM, from about 1.5 mM to about 3.0 mM, from about 2 mM to about 3.0 
mM, from about 0.5 mM to about 2.5 mM, from about 1 mM to about 2.5 
mM, from about 1.5 mM to about 2.5 mM, from about 2 mM to about 3.0 
mM, from about 2.5 mM to about 3.0 mM, from about 0.5 mM to about 2 
mM, from about 0.5 mM to about 1,5 mM, from about 0.5 mM to about 1.1 
mM, from about 5.0 mM to about 10 mM, from about 5.0 mM to about 15 
mM, from about 5.0 mM to about 20 mM, from about 10 mM to about 15 
mM, from about 10 mM to about 20 mM, etc.). 

Reaction solutions of the invention may also contain one or more 
ioninc or non-ionic detergent (e.g., Triton X-100™, NonidetP40™, sodium 
dodecyl sulphate, etc.). When included in reaction solutions of the invention, 
detergents will often be present either individually or in a combined 
concentration of from about 0.01% to about 5.0% (e.g., about 0.01%, about 
0.02%, about 0.03%, about 0.04%, about 0.05%, about 0.06%, about 0.07%, 
about 0.08%, about 0.09%, about 0.1%, about 0.15%, about 0.2%, about 0.3%, 
about 0.5%, about 0.7%, about 0.9%, about 1%, about 2%, about 3%, about 
4%, about 5%, from about 0.01% to about 5.0%, from about 0.01% to about 
4.0%, from about 0.01% to about 3.0%, from about 0.01% to about 2.0%, 
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from about 0.01% to about 1.0%, from about 0.05% to about 5.0%, from about 
0.05% to about 3.0%, from about 0.05% to about 2.0%, from about 0.05% to 
about 1.0%, from about 0.1% to about 5.0%, from about 0.1% to about 4.0%, 
from about 0.1% to about 3.0%, from about 0.1% to about 2.0%, from about 
0.1% to about 1.0%, from about 0.1% to about 0.5%, etc.). For example, 
reaction solutions of the invention may contain Triton X-100™ at a 
concentration of from about 0.01% to about 2.0%, from about 0.03% to about 
1.0%, from about 0.04% to about 1.0%, from about 0.05% to about 0.5%, 
from about 0,04% to about 0.6%, from about 0.04% to about 0.3%, etc. 
[0208] Reaction solutions of the invention may also contain one or more DNA 

polymerase inhibitor (e.g., Actinomycin D, etc.). When included in reaction 
solutions of the invention, such inhibitors will often be present either 
individually or in a combined concentration of from about 0.1 pg/ml to about 
100 pg/ml (e.g., about 0.1 pg/ml, about 0.2 pg/ml, about 0.3 pg/ml, about 0.4 
pg/ml, about 0.5 pg/ml, about 0.6 pg/ml, about 0.7 pg/ml, about 0.8 pg/ml, 
about 0.9 pg/ml, about 1.0 pg/ml, about 1.1 pg/ml, about 1.3 pg/ml, about 1.5 
pg/ml, about 1.7 pg/ml, about 2.0 pg/ml, about 2.5 pg/ml, about 3.5 pg/ml, 
about 5.0 jag/ml, about 7.5 ug/ml, about 10 pg/ml, about 15 pg/ml, about 20 
pg/ml, about 25 pg/ml, about 30 pg/ml, about 35 pg/ml, about 40 pg/ml, 
about 50 pg/ml, about 60 pg/ml, about 70 pg/ml, about 80 pg/ml, about 90 
pg/ml, about 100 pg/ml, from about 0.5 pg/ml to about 30 pg/ml, from about 
0.75 pg/ml to about 30 pg/ml, from about 1.0 pg/ml to about 30 pg/ml, from 
about 2.0 pg/ml to about 30 pg/ml, from about 3.0 pg/ml to about 30 pg/ml, 
from about 4.0 pg/ml to about 30 pg/ml, from about 5.0 pg/ml to about 30 
pg/ml, from about 7.5 pg/ml to about 30 pg/ml, from about 10 pg/ml to about 
30 pg/ml, from about 15 pg/ml to about 30 pg/ml, from about 0.5 pg/ml to 
about 20 pg/ml, from about 0.5 pg/ml to about 10 pg/ml, from about 0.5 
pg/ml to about 5 pg/ml, from about 0.5 pg/ml to about 2 pg/ml, from about 
0.5 pg/ml to about 1 pg/ml, from about 1 pg/ml to about 10 pg/ml, from about 
1 pg/ml to about 5 pg/ml, from about 1 pg/ml to about 2 pg/ml, from about 1 
pg/ml to about 100 pg/ml, from about 10 pg/ml to about 100 pg/ml, from 
about 20 pg/ml to about 100 pg/ml, from about 40 pg/ml to about 100 pg/ml, 
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from about 30 jag/ml to about 80 fxg/ml, from about 30 fag/ml to about 70 
^ig/ml, from about 40 jag/ml to about 60 |wg/ml, from about 40 ^ig/ml to about 
70 jag/ml, from about 40 jug/ml to about 80 |ug/ml, etc.). 
[0209] In many instances, nucleotides (e.g., dNTPs, such as dGTP, dATP, 

dCTP, dTTP, etc.) will be present in reaction mixtures of the invention. 
Typically, individually nucleotides will be present in concentrations of from 
about 0.05 mM to about 50 mM {e.g., about 0.07 mM, about 0.1 mM, about 
0.15 mM, about 0.18 mM, about 0.2 mM, about 0.3 mM, about 0.5 mM, about 
0.7 mM, about 0.9 mM, about 1 mM, about 2 mM, about 3 mM, about 4 mM, 
about 5 mM, about 6 mM, about 10 mM, about 12 mM, about 15 mM, about 
17 mM, about 20 mM, about 22 mM, about 23 mM, about 24 mM, about 25 
mM, about 27 mM, about 30 mM, about 35 mM, about 40 mM, about 45 mM, 
about 50 mM, from about 0.1 mM to about 50 mM, from about 0.5 mM to 
about 50 mM, from about 1 mM to about 50 mM, from about 2 mM to about 
50 mM, from about 3 mM to about 50 mM, from about 0.5 mM to about 20 
mM, from about 0.5 mM to about 10 mM, from about 0.5 mM to about 5 mM, 
from about 0.5 mM to about 2.5 mM, from about 1 mM to about 20 mM, from 
about 1 mM to about 10 mM, from about 1 mM to about 5 mM, from about 1 
mM to about 3.4 mM, from about 0.5 mM to about 3.0 mM, from about 1 mM 
to about 3.0 mM, from about 1.5 mM to about 3.0 mM, from about 2 mM to 
about 3.0 mM, from about 0.5 mM to about 2.5 mM, from about 1 mM to 
about 2.5 mM, from about 1.5 mM to about 2.5 mM, from about 2 mM to 
about 3.0 mM, from about 2.5 mM to about 3.0 mM, from about 0.5 mM to 
about 2 mM, from about 0.5 mM to about 1.5 mM, from about 0.5 mM to 
about 1.1 mM, from about 5.0 mM to about 10 mM, from about 5.0 mM to 
about 15 mM, from about 5.0 mM to about 20 mM, from about 10 mM to 
about 15 mM, from about 10 mM to about 20 mM, etc.). The combined 
nucleotide concentration, when more than one nucleotides is present, can be 
determined by adding the concentrations of the individual nucleotides 
together. When more than one nucleotide is present in reaction solutions of 
the invention, the individual nucleotides may not be present in equimolar 
amounts. Thus, a reaction solution may contain, for example, 1 mM dGTP, 1 
mM dATP, 0.5 mM dCTP, and 1 mM dTTP. 
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[0210] RNA will typically be present in reaction solutions of the invention. In 

most instances, RNA will be added to the reaction solution shortly prior to 
reverse transcription. Thus, reaction solutions may be provided without RNA. 
This will typically be the case when reaction solutions are provided in kits, 
RNA, when present in reaction solutions will often be present in a 
concentration of 1 picogram to 100 (ag/20 jil reaction mixture {e.g., about 1 
picogram/20 jliI, about 10 picograms/20 yl, about 50 picograms/20 \x\, about 
100 picograms/20 jxl, about 200 picograms/20 \xl, about 10 picograms/20 
about 500 picograms/20 pi, about 800 picograms/20 jul, about 1.0 
nanogram/20 jxl, about 5.0 nanograms/20 jil, about 10 nanograms/20 \xl, about 
25 nanograms/20 \d, about 50 nanograms/20 \xl, about 75 nanograms/20 pal, 
about 100 nanograms/20 jal, about 150 nanograms/20 jul, about 250 
nanograms/20 jxl, about 400 nanograms/20 \xl, about 500 nanograms/20 
about 750 nanograms/20 jil, about 1.0 jag/20 jxl, about 5.0 ng/20 jul, about 10 
jag/20 |xl, about 20 jxg/20 )al, about 30 jag/20 (il, about 40 |xg/20 about 50 
Hg/20 |al, about 70 ng/20 about 85 jig/20 yl 9 about 100 jag/20 jxl, from 
about 10 picograms/20 jxl to about 100 |^g/20 \jl 9 from about 10 picograms/20 
III to about 100 jig/20 jul, from about 100 picograms/20 |il to about 100 jug/20 
jul, from about 1.0 nanograms/20 jal to about 100 jag/20 (xl, from about 100 
nanograms/20 |nl to about 100 (ig/20 (il, from about 10 picograms/20 jal to 
about 10 |ig/20 jal, from about 10 picograms/20 \xl to about 5 jag/20 |nl, from 
about 100 nanograms/20 jal to about 5 jag/20 \il 9 from about 1 |ng/20 |nl to 
about 10 |ug/20 jal, from about 1 |ug/20 |al to about 5 |xg/20 jlxI, from about 100 
nanograms/20 |xl to about 1 jag/20 fal, from about 500 nanograms/20 |ul to 
about 5 jig/20 jal, etc.). As one skilled in the art would recognize, different 
reverse transcription reactions may be performed in volumes other than 20 jal. 
In such instances, the total amount of RNA present will vary with the volume 
used. Thus, the above amounts are provided as examples of the amount of 
RNA/20 jal of reaction solution. 
[0211] Reverse transcriptases (e.g., reverse transcriptases of the invention) 

may also be present in reaction solutions. When present, reverse 
transcriptases, will often be present in a concentration which results in about 
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0.01 to about 1,000 units of reverse transcriptase activity/fxl (e.g. , about 0.01 
unit/|al, about 0.05 united, about 0.1 unit/j^l, about 0.2 unit/jxl, about 0.3 
unit/nl, about 0.4 unit/jal, about 0.5 unit/|al, about 0.7 unit/|ul, about 1.0 unit/jal, 
about 1.5 united, about 2.0 unit/jal, about 2.5 unit/^il, about 5.0 unit/^il, about 
7.5 unit/jal, about 10 unit/^1, about 20 unit/|al, about 25 unit/|il, about 50 
unit/^il, about 100 unit/jiil, about 150 unit/|Ltl, about 200 unit/^1, about 250 
unit/jLtl, about 350 unit/|al, about 500 united, about 750 unit/jal, about 1,000 
unit/|ul, from about 0.1 unit/jal to about 1,000 unit/|il, from about 0.2 unit/|xl to 
about 1,000 unit/|Lil, from about 1.0 unit/jal to about 1,000 unit/jal, from about 
5.0 united to about 1,000 unit/jal, from about 10 unit/jal to about 1,000 unit/jxl, 
from about 20 unit/|nl to about 1,000 unit/jal, from about 50 unit/jal to about 
1,000 unit/jixl, from about 100 unit/jil to about 1,000 unit/jal, from about 200 
unit/jal to about 1,000 unit/|il, from about 400 unit/jal to about 1,000 unit/|ul, 
from about 500 unit/jal to about 1,000 unit/jul, from about 0.1 unit/jal to about 
300 unit/|nl, from about 0.1 unit/|al to about 200 unit/jal, from about 0.1 unit/|il 
to about 100 unit/jil, from about 0.1 unit/jal to about 50 unit/j^l, from about 0.1 
unit/jul to about 10 unit/|Lil, from about 0.1 unit/jal to about 5.0 unit/^l, from 
about 0.1 unit/^il to about 1.0 unit/jal, from about 0.2 unit/jal to about 0.5 
unit/jxl, etc. 

[0212] Reaction solutions of the invention may be prepared as concentrated 

solutions (e.g., 5X solutions) which are diluted to a working concentration for 
final use. With respect to a 5X reaction solution, a 5:1 dilution is required to 
bring such a 5X solution to a working concentration. Reaction solutions of the 
invention may be prepared, for examples, as a 2X, a 3X, a 4X, a 5X, a 6X, a 
7X, a 8X, a 9X, a 10X, etc. solutions. One major limitation on the fold 
concentration of such solutions is that, when compounds reach particular 
concentrations in solution, precipitation occurs. Thus, concentrated reaction 
solutions will generally be prepared such that the concentrations of the various 
components are low enough so that precipitation of buffer components will 
not occur. As one skilled in the art would recognize, the upper limit of 
concentration which is feasible for each solution will vary with the particular 
solution and the components present. 
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[0213] In many instances, reaction solutions of the invention wilKbe provided 

in sterile form. Sterilization may be performed on the individual components 
of reaction solutions prior to mixing or on reaction solutions after they are 
prepared. Sterilization of such solutions may be performed by any suitable 
means including autoclaving or ultrafiltration. 

Labeling Nucleic Acids 

[0214] In general, the invention provides, in part, compositions for use in 

reverse transcription of a nucleic acid molecule to produce labeled nucleic 
acid molecules. Such compositions may comprise one or more reverse 
transcriptases (e.g., single subunit and/or multi-subunit RTs). The enzymes in 
these compositions are preferably present in working concentrations and have 
RNase H activity or are reduced or substantially reduced or lacking in RNase 
H activity, although mixtures of enzymes, some having RNase H activity and 
some reduced or substantially reduced or lacking in RNase H activity, may be 
used in the compositions of the invention. Preferred reverse transcriptases 
include M-MLV reverse transcriptase, RSV reverse transcriptase, AMV 
reverse transcriptase, RAV reverse transcriptase, MAV reverse transcriptase 
and HIV reverse transcriptase or other ASLV reverse transcriptases. 
[0215] The invention is also directed to methods for reverse transcription of 

one or more nucleic acid molecules comprising mixing one or more nucleic 
acid templates, which is preferably RNA or messenger RNA (mRNA) and 
more preferably a population of mRNA molecules, with one or more 
polypeptides having reverse transcriptase activity and incubating the mixture 
under conditions sufficient to make one or more labeled nucleic acid 
molecules complementary to all or a portion of the one or more templates. To 
make the nucleic acid molecule or molecules complementary to the one or 
more templates, at least one primer (e.g., an oligo(dT) primer) and one or 
more nucleotides (a portion of which are preferably labeled, most preferably 
fluorescently labeled) are used for nucleic acid synthesis. Nucleic acid 
templates suitable for reverse transcription according to this aspect of the 
invention include any nucleic acid molecule, particularly those derived from a 
prokaryotic or eukaryotic cell. Such cells may include normal cells, diseased 
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cells, transformed cells, established cells, progenitor cells, precursor cells, 
fetal cells, embryonic cells, bacterial cells, yeast cells, animal cells (including 
human cells), avian cells, plant cells and the like, or tissue isolated from a 
plant or an animal {e.g., human, cow, pig, mouse, sheep, horse, monkey, 
canine, feline, rat, rabbit, bird, fish, insect, etc.). Such nucleic acid molecules 
may also be isolated from viruses. In some embodiments, methods of the 
invention result in the direct labeling of a nucleic acid molecule by 
incorporation of a labeled nucleotide (e.g., a nucleotide containing a 
fluorescent label). In other methods, nucleic acid molecules are indirectly 
labeled by first, incorporating a nucleotide analog containing a reactive 
functionality to produce a nucleic acid containing one or more reactive 
functionalities. The nucleic acid containing reactive functionalities may 
subsequently be reacted with a molecule containing a label that reacts with the 
functionality to attach the label to the nucleic acid molecule. The reaction 
may be result in covalent attachment of all or a portion of the label-containing 
molecule to the nucleic acid molecule (e.g., chemical coupling). In some 
embodiments, amine-modified NTPs (e.g., amino allyl-dUTP/UTP) are 
incorporated during reverse transcription. Amino allyl-NTPs are incorporated 
with similar efficiency as unmodified NTPs during enzymatic reactions such 
as reverse transcription. The amine functionality is then coupled with a dye 
using standard techniques. Kits for indirect labeling of cDNA are 
commercially available from, for example, Ambion, Inc., Austin, TX. In some 
embodiments, the label-containing molecule may be non-covalently bound to 
the reactive functionality. For example, the reactive functionality may be a 
biotin moiety and the label-containing molecule may be a labeled (e.g., 
fluorescently labeled) avidin or streptavidin molecule. 
[0216] The invention also provides labeled nucleic acid molecules produced 

according to the above-described methods. Such labeled nucleic acid 
molecules may be single or double stranded and are useful as detection probes. 
Depending on the labeled nucleotide(s) used during synthesis, the labeled 
molecules may contain one or a number of labels. Where multiple labels are 
used, the molecules may comprise a number of the same or different labels. 
Thus, one type or multiple different labeled nucleotides may be used during 
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synthesis of nucleic acid molecules to provide for the labeled nucleic, acid 
molecules of the invention. Such labeled nucleic acid molecules will thus 
comprise one or more (e.g., two, three, four, five, six, seven, eight, nine, ten, 
etc.) labeled nucleotides (which may be the same or different). 
[0217] Labeled nucleic acid molecules produced by methods of the invention 

may either (1) comprise a particular numbers of labeled nucleotides or (2) a 
particular percentage of the nucleotides present in the nucleic acid molecule 
may be labeled. In either instance, the concentration of labeled nucleotides 
present in the reaction mixture may be adjusted, with respect to un-labeled 
nucleotides, such that product nucleic acid molecules are produced which 
contain (1) a particular number of labeled nucleotides or (2) a particular 
percentage of label nucleotides as compared to un-labeled nucleotides. In 
particular instances, from about 0.1% to about 20%, from about 0.1% to about 
15%, from about 0.1% to about 10%, from about 0.1% to about 5.0%, from 
about 0.1% to about 2.5%, from about 0.1% to about 1.5%, from about 0.1% 
to about 1.0%, from about 0.1% to about 0.5%, from about 2.0% to about 
20%, from about 4.0% to about 20%, from about 0.5% to about 10%, from 
about 0.5% to about 5%, from about 0.5% to about 2.0%, or from about 0.5% 
to about 1.0% of the total nucleotides present in product nucleic acid 
molecules are labeled. As one skilled in the art would recognize, the actual 
number of labeled nucleotides, and thus the percentage of nucleotides which 
are labeled, present in individual molecules of a population will typically 
differ. In other words, different members of populations of nucleic acid 
molecules will typically contain different numbers of labeled nucleotides with 
the overall average of labeled nucleotides present in each product molecule 
varying with a number of factors (e.g., the ratio of labeled to un-labeled 
nucleotides present in the reaction mixture). In most instances, at least 85%, 
at least 90%, at least 95%, or at least 99% of the individual product nucleic 
acid molecules in the population will contain labeled nucleotides which fall 
within a range set out above. 
[0218] In accordance with the invention, the amount of labeled product is 

preferably measured based on percent incorporation of the label of interest 
into synthesized product as may be determined by one skilled in the art, 
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although other means of measuring the amount or efficiency of labeling of 
product will be recognized by one of ordinary skill in the art. The invention 
provides for enhanced or increased percent incorporation of labeled nucleotide 
during synthesis of a nucleic acid molecule from a template, preferably during 
synthesis of one or more cDNA molecules from RNA. According to the 
invention, such enhancement or increase in percent incorporation is preferably 
about equal to or greater than a 2-fold, a 5-fold, a 10-fold, a 15-fold, a 20-fold, 
a 25-fold, a 30-fold, a 40-fold or a 50-fold increase or enhancement in percent 
incorporation compared to a standard reverse transcriptase. 
[0219] The invention also provides kits for use in accordance with the 

invention. Such kits comprise a carrier means, such as a box or carton, having 
in close confinement therein one or more container means, such as vials, 
tubes, bottles and the like, wherein the kit comprises, in the same or different 
containers, one or more reverse transcriptases. The kits of the invention may 
also comprise, in the same or different containers, one or more DNA 
polymerases, one or more primers, one or more suitable buffers and/or one or 
more nucleotides (such as deoxynucleoside triphosphates (dNTPs) and 
preferably labeled dNTPs (e.g., fluorescently labeled dNTPs)). 
[0220] In some embodiments, the RTs used in the invention comprise two or 

more subunits (or derivatives, variants, fragments or mutants thereof) and 
preferably comprise two subunits (e.g., a dimer or heterodimer). Two subunit 
reverse transcriptases typically have an a and a p subunit forming a dimer, 
although any form or combination of subunits (and derivatives, variants or 
mutants of such subunits) may be used. Such combinations may include ocP, 
PP, aa and the like. Preferred two subunit RTs for use in the invention 
include RSV RT, AMV RT, AEV RT, RAV RT, HIV RT and MAV RT, or 
other ASLV RTs, or mutants, variants or derivatives thereof. In a particular 
embodiment, AMV RT and/or RSV RT is used in accordance with the 
invention. Preferred single subunit RTs include M-MLV reverse transcriptase. 

Production/Sources of cDNA Molecules 
[0221] In accordance with the invention, cDNA molecules (single-stranded or 

double-stranded) may be prepared from a variety of nucleic acid template 
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molecules. Preferred nucleic acid molecules for use in the_present invention 
include single-stranded or double-stranded DNA and RNA molecules, as well 
as double-stranded DNA: RNA hybrids. More preferred nucleic acid 
molecules include messenger RNA (mRNA), transfer RNA (tRNA) and 
ribosomal RNA (rRNA) molecules, although mRNA molecules are the 
preferred template according to the invention. 
1 The nucleic acid molecules that are used to prepare cDNA molecules 

according to the methods of the present invention may be prepared 
synthetically according to standard organic chemical synthesis methods that 
will be familiar to one of ordinary skill. More preferably, the nucleic acid 
molecules may be obtained from natural sources, such as a variety of cells, 
tissues, organs or organisms. Cells that may be used as sources of nucleic acid 
molecules may be prokaryotic (bacterial cells, including but not limited to 
those of species of the genera Escherichia, Bacillus, Serratia, Salmonella, 
Staphylococcus, Streptococcus, Clostridium, Chlamydia, Neisseria, 
Treponema, Mycoplasma, Borrelia, Legionella, Pseudomonas, 
Mycobacterium, Helicobacter, Erwinia, Agrobacterium, Rhizobium, 
Xanthomonas and Streptomyces) or eukaryotic (including fungi (especially 
yeasts), plants, protozoans and other parasites, and animals including insects 
(particularly Drosophila spp. cells), nematodes (particularly Caenorhabditis 
elegans cells), and mammals (particularly human cells)). 

Mammalian somatic cells that may be used as sources of nucleic acids 
include blood cells (reticulocytes and leukocytes), endothelial cells, epithelial 
cells, neuronal cells (from the central or peripheral nervous systems), muscle 
cells (including myocytes and myoblasts from skeletal, smooth or cardiac 
muscle), connective tissue cells (including fibroblasts, adipocytes, 
chondrocytes, chondroblasts, osteocytes and osteoblasts) and other stromal 
cells {e.g., macrophages, dendritic cells, Schwann cells). Mammalian germ 
cells (spermatocytes and oocytes) may also be used as sources of nucleic acids 
for use in the invention, as may the progenitors, precursors and stem cells that 
give rise to the above somatic and germ cells. Also suitable for use as nucleic 
acid sources are mammalian tissues or organs such as those derived from 
brain, kidney, liver, pancreas, blood, bone marrow, muscle, nervous, skin, 
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genitourinary, circulatory, lymphoid, gastrointestinal and connective tissue 
sources, as well as those derived from a mammalian (including human) 
embryo or fetus. 

[0224] Any of the above prokaryotic or eukaryotic cells, tissues and organs 

may be normal, diseased, transformed, established, progenitors, precursors, 
fetal or embryonic. Diseased cells may, for example, include those involved 
in infectious diseases (caused by bacteria, fungi or yeast, viruses (including 
AIDS, HIV, HTLV, herpes, hepatitis and the like) or parasites), in genetic or 
biochemical pathologies (e.g., cystic fibrosis, hemophilia, Alzheimer's disease, 
muscular dystrophy or multiple sclerosis) or in cancerous processes. 
Transformed or established animal cell lines may include, for example, COS 
cells, CHO cells, VERO cells, BHK cells, HeLa cells; HepG2^cells ? K562 
cells, 293 cells, L929 cells, F9 cells, and the like. Other cells, cell lines, 
tissues, organs and organisms suitable as sources of nucleic acids for use in the 
present invention will be apparent to one of ordinary skill in the art. 

[0225] Once the starting cells, tissues, organs or other samples are obtained, 

nucleic acid molecules (such as mRNA) may be isolated therefrom by 
methods that are well-known in the art (See, e.g., Maniatis, T., et al. y Cell 
75:687-701 (1978); Okayama, H., and Berg, P., Mol. Cell. Biol. 2:161-170 
(1982); Gubler, U., and Hoffman, B.J., Gene 25:263-269 (1983)). The nucleic 
acid molecules thus isolated may then be used to prepare cDNA molecules and 
cDNA libraries in accordance with the present invention. 

[0226] In the practice of the invention, cDNA molecules or cDNA libraries 

are produced by mixing one or more nucleic acid molecules obtained as 
described above, which is preferably one or more mRNA molecules such as a 
population of mRNA molecules, with a polypeptide having reverse 
transcriptase activity of the present invention, or with one or more of the 
compositions of the invention, under conditions favoring the reverse 
transcription of the nucleic acid molecule by the action of the enzymes or the 
compositions to form one or more cDNA molecules (single-stranded or 
double-stranded). Thus, the method of the invention comprises (a) mixing one 
or more nucleic acid templates (preferably one or more RNA or mRNA 
templates, such as a population of mRNA molecules) with one or more reverse 
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transcriptases of the invention and (b) incubating the mixture under conditions 
sufficient to make one or more nucleic acid molecules complementary to all or 
a portion of the one or more templates. Such methods may include the use of 
one or more DNA polymerases, one or more nucleotides, one or more primers, 
one or more buffers, and the like. The invention may be used in conjunction 
with methods of cDNA synthesis such as those described in the Examples 
below, or others that are well-known in the art (see, e.g., Gubler, U., and 
Hoffman, B.J., Gene 25:263-269 (1983); Krug, M.S., and Berger, S.L., Meth. 
Enzymol 752:316-325 (1987); Sambrook, J., et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor, NY: Cold Spring Harbor 
Laboratory Press, pp. 8.60-8.63 (1989); PCT Publication No. WO 99/15702; 
PCT Publication No. WO 98/47912; and PCT Publication No. WO 98/51699), 
to produce cDNA molecules or libraries. 
[0227] Other methods of cDNA synthesis which may advantageously use the 

present invention will be readily apparent to one of ordinary skill in the art. 
[0228] Having obtained cDNA molecules or libraries according to the present 

methods, these cDNAs may be isolated for further analysis or manipulation. 
Detailed methodologies for purification of cDNAs are taught in the 
GENETRAPPER™ manual (Invitrogen Corporation (Carlsbad, CA)), which is 
incorporated herein by reference in its entirety, although alternative standard 
techniques of cDNA isolation that are known in the art (see, e.g., Sambrook, 
J., et al, Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory Press, pp. 8.60-8.63 (1989)) may 
also be used. 

[0229] In other aspects of the invention, the invention may be used in methods 

for amplifying and sequencing nucleic acid molecules. Nucleic acid 
amplification methods according to this aspect of the invention may be one- 
step (e.g., one-step RT-PCR) or two-step (e.g., two-step RT-PCR) reactions. 
According to the invention, one-step RT-PCR type reactions may be 
accomplished in one tube thereby lowering the possibility of contamination. 
Such one-step reactions comprise (a) mixing a nucleic acid template (e.g., 
mRNA) with one or more reverse transcriptases of the present invention and 
with one or more DNA polymerases and (b) incubating the mixture under 
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conditions sufficient to amplify a nucleic acid molecule .complementary to all 
or a portion of the template. Such amplification may be accomplished by the 
reverse transcriptase activity alone or in combination with the DNA 
polymerase activity. Two-step RT-PCR reactions may be accomplished in 
two separate steps. Such a method comprises (a) mixing a nucleic acid 
template (e.g., mRNA) with a reverse transcriptase of the present invention, 
(b) incubating the mixture under conditions sufficient to make a nucleic acid 
molecule (e.g., a DNA molecule) complementary to all or a portion of the 
template, (c) mixing the nucleic acid molecule with one or more DNA 
polymerases, and (d) incubating the mixture of step (c) under conditions 
sufficient to amplify the nucleic acid molecule. For amplification of long 
nucleic acid molecules (i.e., greater than* about 3-5 Kb^in length), a 
combination of DNA polymerases may be used, such as one DNA polymerase 
having 3' exonuclease activity and another DNA polymerase being 
substantially reduced in 3' exonuclease activity. 

[0230] Nucleic acid sequencing methods according to this aspect of the 

invention may comprise both cycle sequencing (sequencing in combination 
with amplification) and standard sequencing reactions. The sequencing 
method of the invention thus comprises (a) mixing a nucleic acid molecule to 
be sequenced with one or more primers, one or more reverse transcriptases of 
the invention, one or more nucleotides and one or more terminating agents, (b) 
incubating the mixture under conditions sufficient to synthesize a population 
of nucleic acid molecules complementary to all or a portion of the molecule to 
be sequenced, and (c) separating the population to determine the nucleotide 
sequence of all or a portion of the molecule to be sequenced. According to the 
invention, one or more DNA polymerases (preferably thermostable DNA 
polymerases) may be used in combination with or separate from the reverse 
transcriptases of the invention. 

[0231] Amplification methods which may be used in accordance with the 

present invention include PCR (U.S. Patent Nos. 4,683,195 and 4,683,202), 
Strand Displacement Amplification (SDA; U.S. Patent No, 5,455,166; EP 0 
684 315), and Nucleic Acid Sequence-Based Amplification (NASBA; U.S. 
Patent No. 5,409,818; EP 0 329 822), as well as more complex PCR-based 
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nucleic acid fingerprinting techniques such as _ Random _ Amplified 
Polymorphic DNA (RAPD) analysis (Williams, J.G.K., et al, NucL Acids Res. 
18(22):653l-6535, 1990), Arbitrarily Primed PCR (AP-PCR; Welsh, J., and 
McClelland, M, NucL Acids Res. /^;:7213-7218, 1990), DNA 
Amplification Fingerprinting (DAF; Caetano-Anolles et aL, Bio/Technology 
9:553-557, 1991), microsatellite PCR or Directed Amplification of 
Minisatellite-region DNA (DAMD; Heath, D.D., et al. 9 NucL Acids Res. 
21(24): 5782-5785, 1993), and Amplification Fragment Length Polymorphism 
(AFLP) analysis (EP 0 534 858; Vos, P., et aL, NucL Acids Res. 23(21):A401- 
4414, 1995; Lin, J.J., and Kuo, J., FOCUS 17(2):66-10, 1995). Nucleic acid 
sequencing techniques which may employ the present compositions include 
dideoxy sequencing methods such as those disclosed in iJ.S. Patent Nos. 
4,962,022 and 5,498,523. In a particularly preferred aspects, the invention 
may be used in methods of amplifying or sequencing a nucleic acid molecule 
comprising one or more polymerase chain reactions (PCRs), such as any of the 
PCR-based methods described above. 

Kits 

[0232] In another embodiment, the present invention may be assembled into 

kits, which may be used in reverse transcription or amplification of a nucleic 
acid molecule, or into kits for use in sequencing of a nucleic acid molecule. 
Kits according to this aspect of the invention comprise a carrier means, such as 
a box, carton, tube or the like, having in close confinement therein one or 
more container means, such as vials, tubes, ampoules, bottles and the like, 
wherein a first container means contains one or more polypeptides of the 
present invention having reverse transcriptase activity. When more than one 
polypeptide having reverse transcriptase activity is used, they may be in a 
single container as mixtures of two or more polypeptides, or in separate 
containers. The kits of the invention may also comprise (in the same or 
separate containers) one or more DNA polymerases, a suitable buffer, one or 
more nucleotides and/or one or more primers. The kits of the invention may 
also comprise one or more hosts or cells including those that are competent to 
take up nucleic acids (e.g., DNA molecules including vectors). Preferred 
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hosts may include chemically competent or electrocompetent bacteria such as 
E. coli (including DH5, DH5oc, DH10B, HB101, Top 10, and other K-12 
strains as well as E. coli B and E. coli W strains). 
[0233] In a specific aspect of the invention, the kits of the invention {e.g., 

reverse transcription and amplification kits) may comprise one or more 
components (in mixtures or separately) including one or more polypeptides 
having reverse transcriptase activity of the invention, one or more nucleotides 
(one or more of which may be labeled, e.g., fiuorescently labeled) used for 
synthesis of a nucleic acid molecule, and/or one or more primers (e.g., 
oligo(dT) for reverse transcription). Such kits (including the reverse 
transcription and amplification kits) may further comprise one or more DNA 
polymerases. Sequencing kits of the invention may comprise one or more 
polypeptides having reverse transcriptase activity of the invention, and 
optionally one or more DNA polymerases, one or more terminating agents 
(e.g., dideoxynucleoside triphosphate molecules) used for sequencing of a 
nucleic acid molecule, one or more nucleotides and/or one or more primers. 
Preferred polypeptides having reverse transcriptase activity, DNA 
polymerases, nucleotides, primers and other components suitable for use in the 
reverse transcription, amplification and sequencing kits of the invention 
include those described above. The kits encompassed by this aspect of the 
present invention may further comprise additional reagents and compounds 
necessary for carrying out standard nucleic acid reverse transcription, 
amplification or sequencing protocols. Such polypeptides having reverse 
transcriptase activity of the invention, DNA polymerases, nucleotides, 
primers, and additional reagents, components or compounds may be contained 
in one or more containers, and may be contained in such containers in a 
mixture of two or more of the above-noted components or may be contained in 
the kits of the invention in separate containers. Such kits may also comprise 
instructions (e.g., for performing the methods of the invention such as for 
labeling nucleic acid molecules in accordance with the invention). 
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Use of Nucleic Acid Molecules 

[0234] The nucleic acid molecules or cDNA libraries prepared by the methods 

of the present invention may be further characterized, for example by cloning 
and sequencing (i.e., determining the nucleotide sequence of the nucleic acid 
molecule), by the sequencing methods of the invention or by others that are 
standard in the art (see, e.g., U.S. Patent Nos. 4,962,022 and 5,498,523, which 
are directed to methods of DNA sequencing). Alternatively, these nucleic acid 
molecules may be used for the manufacture of various materials in industrial 
processes, such as hybridization probes by methods that are well-known in the 
art. Production of hybridization probes from cDNAs will, for example, 
provide the ability for those in the medical field to examine a patient's cells or 
tissues for the presence of a particular genetic marker such as a marker of 
cancer, of an infectious or genetic disease, or a marker of embryonic 
development. Furthermore, such hybridization probes can be used to isolate 
DNA fragments from genomic DNA or cDNA libraries prepared from a 
different cell, tissue or organism for further characterization. 

[0235] The nucleic acid molecules of the present invention may also be used 

to prepare compositions for use in recombinant DNA methodologies. 
Accordingly, the present invention relates to recombinant vectors which 
comprise the cDNA or amplified nucleic acid molecules of the present 
invention, to host cells which are genetically engineered with the recombinant 
vectors, to methods for the production of a recombinant polypeptide using 
these vectors and host cells, and to recombinant polypeptides produced using 
these methods. 

[0236] Recombinant vectors may be produced according to this aspect of the 

invention by inserting, using methods that are well-known in the art, one or 
more of the cDNA molecules or amplified nucleic acid molecules prepared 
according to the present methods into a vector. The vector used in this aspect 
of the invention may be, for example, a phage or a plasmid, and is preferably a 
plasmid. Preferred are vectors comprising c/s-acting control regions to the 
nucleic acid encoding the polypeptide of interest. Appropriate ^raws-acting 
factors may be supplied by the host, supplied by a complementing vector or 
supplied by the vector itself upon introduction into the host. 
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[0237] In certain preferred embodiments in this regard, the vectors provide for 

specific expression (and are therefore termed "expression vectors"), which 
may be inducible and/or cell type-specific. Particularly preferred among such 
vectors are those inducible by environmental factors that are easy to 
manipulate, such as temperature and nutrient additives. 
[0238] Expression vectors useful in the present invention include 

chromosomal-, episomal- and virus-derived vectors, e.g., vectors derived from 
bacterial plasmids or bacteriophages, and vectors derived from combinations 
thereof, such as cosmids and phagemids, and will preferably include at least 
one selectable marker such as a tetracycline or ampicillin resistance gene for 
culturing in a bacterial host cell. Prior to insertion into such an expression 
vector, the cDNA or amplified nucleic acid molecules oftthe invention should 
be operatively linked to an appropriate promoter, such as the phage lambda P L 
promoter, the E. coli lac, trp and tac promoters. Other suitable promoters will 
be known to the skilled artisan. 
[0239] Among vectors preferred for use in the present invention include 

pQE70, pQE60 and pQE-9, available from Qiagen; pBS vectors, Phagescript 
vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available 
from Stratagene; pcDNA3 available from Invitrogen; pGEX, pTrxfus, 
pTrc99a, pET-5, pET-9, pKK223-3, pKK233-3, pDR540, pRIT5 available 
from Pharmacia; and pSPORTl, pSPORT2 and pSV*SPORTl, available from 
Invitrogen Corporation (Carlsbad, CA). Other suitable vectors will be readily 
apparent to the skilled artisan. 
[0240] The invention also provides methods of producing a recombinant host 

cell comprising the cDNA molecules, amplified nucleic acid molecules or 
recombinant vectors of the invention, as well as host cells produced by such 
methods. Representative host cells (prokaryotic or eukaryotic) that may be 
produced according to the invention include, but are not limited to, bacterial 
cells, yeast cells, plant cells and animal cells. Preferred bacterial host cells 
include Escherichia coli cells (most particularly E. coli strains DH10B and 
Stbl2, which are available commercially (Invitrogen Corporation (Carlsbad, 
CA)), Bacillus subtilis cells, Bacillus megaterium cells, Streptomyces spp. 
cells, Erwinia spp. cells, Klebsiella spp. cells and Salmonella typhimurium 
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cells. Preferred animal host cells include insect cells (most particularly 
Spodoptera frugiperda SJ9 and SJ21 cells and Trichoplusa HigH-Five cells) 
and mammalian cells (most particularly CHO, COS, VERO, BHK and human 
cells). Such host cells may be prepared by well-known transformation, 
electroporation or transfection techniques that will be familiar to one of 
ordinary skill in the art. 
[0241] In addition, the invention provides methods for producing a 

recombinant polypeptide, and polypeptides produced by these methods. 
According to this aspect of the invention, a recombinant polypeptide may be 
produced by culturing any of the above recombinant host cells under 
conditions favoring production of a polypeptide therefrom, and isolation of the 
polypeptide. Methods for culturing recombinantchost cells* and for production 
and isolation of polypeptides therefrom, are well-known to one of ordinary 
skill in the art. 

[0242] It will be readily apparent to one of ordinary skill in the relevant arts 

that other suitable modifications and adaptations to the methods and 
applications described herein are obvious and may be made without departing 
from the scope of the invention or any embodiment thereof. Having now 
described the present invention in detail, the same will be more clearly 
understood by reference to the following examples, which are included 
herewith for purposes of illustration only and are not intended to be limiting of 
the invention. 

EXAMPLES 

Example 1: Preparation of Mutant Reverse Transcriptases 

[0243] Plasmid pBAD was obtained from Invitrogen Corporation, Carlsbad, 

CA and the coding sequence of M-MLV reverse transcriptase was inserted to 
produce plasmid pBAD-6-His-M-MLV H- (Fl). Plasmid pBAD-6-His-M- 
MLV H-(F1) was used as both a cloning vector and as a target for PCR 
mutagenesis (Fig. 1). pBAD-6-His-M-MLV H- (Fl) replicates in E. coli and 
confers ampicillin resistance to transformed cells. The M-MLV reverse 
transcriptase gene is expressed from the ara BAD promoter which is induced 
by the presence of arabinose. The promoter is repressed by the product of the 
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araC gene, which is present on the plasmid. The host used, E. coli strain 
DH10B, is an araD mutant and cannot metabolize arabinose, making 
arabinose a gratuitous inducer in DH10B cells transformed with pBAD-6-His- 
M-MLV H-(F1). The plasmid contains a 6 histidine containing leader 
sequence in frame with the coding sequence of the M-MLV reverse 
transcriptase gene. The gene starting at nucleotide 2598 and ending at 
nucleotide 4628 (Shinnick, et a/., (1981) Nature 293, 543-548.) was cloned 
under control of an araD promoter into plasmid pBAD/HisA (Invitrogen). 
The M-MLV gene was further modified by site-directed mutagenesis without 
changing amino acid coding to include several unique restriction endonuclease 
sites that divided the gene into five segments (Fig. 2). The amino end of the 
protein contained a His 6 tag to simplify purification that included the 
following amino acids: 

MGGSHHHHHHGMASMTGGQQMGRDLYDDDDKH corresponding to 
amino acids 1-32 of SEQ ID NO:2. The carboxy end of the protein contained 
the additional amino acids NSRLIN, corresponding to amino acids 71 1-716 of 
SEQ ED NO:2, present as the result of subcloning from pRT60L In addition, 
the M-MLV RT gene was mutated (D524G, E562Q, D583N) to eliminate 
RNase H activity. The final construct was designated pBAD-HSS2 (Fig. 1), 
and the gene and gene product were designated His 6 H- RT. In addition to this 
construct, other constructs having different N-terminal sequences are 
contemplated in the present invention. For example, a construct beginning at 
methionine 12 of SEQ ID NO:6 and Table 3 and containing a mutation 
changing methionine 15 to glycine (M15G) to produce a protein with an N- 
terminal sequence MASGTGGQQMGRDLYDDDDKH (SEQ ID NO:l) 
followed by the remaining sequence of M-MLV RT from Table 3 has been 
produced as well as a construct beginning with methionine 33 of SEQ ID 
NO:6and Table 3. 

With reference to the sequence of this plasmid provided in Table 3 
(SEQ ID NOs:l and 2), nucleotides 1-96 encode the leader sequence and 
nucleotides 97-99 encode a methionine. Those skilled in the art will 
appreciate that the wild-type M-MLV reverse transcriptase is derived by 
proteolysis from a precursor polyprotein and thus the wild-type M-MLV 
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reverse transcriptase does not start with a methionine. Therefore, amino acid 
number 1 of the M-MLV reverse transcriptase is the threonine (amino acid 34 
in SEQ ID NO:2 and Table 3) following the methionine encoded by 
nucleotides 97-99 (amino acid 33 in SEQ ID NO:2 and Table 3). 
[0245] The sequence of the M-MLV reverse transcriptase gene in pBAD-6- 

His-M-MLV H- (Fl) which was used in these experiments was derived from 
the sequence of plasmid pRT601. pRT601 is described in Kotewicz, et al., 
(1988) Nuc. Acids Res. 16, 265-277, Gerard, et al, (1986) DNA 5, 271-279, 
U.S. Patent Nos. 5,668,005 and 5,017,492, which are incorporated herein by 
reference in their entireties. 
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Table 3 (SEQ ID NOs:l and 2). 

1 atggggggtt ctcatcatca tcatcatcat 
mgg shh hhhh 

61 atgggtcggg atctgtacga cgatgacgat 
mgr dly dddd 

121 tatcggctac atgagacctc aaaagagcca 
yrl het skep 

181 gattttcctc aggcctgggc ggaaaccggg 
dfp qaw aetg 

241 ctgatcatac ttctgaaagc aacctctacc 
lii Ilk atst 

3 01 caagaagcca gactggggat caagccccac 
qea rig ikph 

361 gtaccctgcc agtccccctg gaacacgccc 
vpc qsp wnt 'p 

421 gattacaggc ctgtccaaga tctgagagag 
dyr pvq dire 

481 accgtaccca acccctacaa cctcttgagt 
tvp npy nils 

541 gttctagact taaaagatgc ctttttctgc 
vld lkd affc 

601 ttcgcctttg aatggagaga cccagagatg 
faf ewr dpem 

661 ctcccacagg gattcaaaaa cagtcccacc 
lpq gfk nspt 

721 gcagacttcc ggatccagca cccagacttg 
adf riq hpdl 

781 ctggccgcca cttctgagct cgactgccaa 
laa tse ldcq 

841 ggagacctcg ggtatcgggc ctcggccaag 
gdl gyr asak 

901 tatctggggt atcttctaaa agagggtcag 
ylg yll kegq 

961 gtgatggggc agcctactcc gaagaccccg 
vmg qpt pktp 

1021 ggcttctgtc gcctctggat ccctgggttt 
gfc rlw ipgf 

1081 accaaaacgg ggactctgtt taattggggc 
tkt gtl fnwg 

1141 aagcaagctc ttctaactgc cccagccctg 
kqa lit apal 



ggtatggcta gcatgactgg tggacagcaa 

gma smt g g q q 

aagcatatga ccctaaatat agaagatgag 
khm tin iede 

gatgtttctc tagggtccac atggctgtct 
dvs igs twls 

ggcatgggac tggcagttcg ccaagctcct 
gmg lav rqap 

cccgtgtcca taaaacaata ccccatgtca 
pvs ikq ypms 

atacagagac tgttggacca gggaatactg 
iqr lid qgil 

ctgctacccg tcaagaaacc cgggactaat 
1 '~ 1 p" "v k k p g t n 

gtcaacaaac gcgtagaaga catccacccc 
vnk rve dihp 

gggctcccac cgtcccacca gtggtacact 
glp psh qwyt 

ctgagactcc acccgacgtc tcagcctctc 
lrl hpt sqpl 

ggaatctctg gccaactaac ctggaccaga 
gis gql twtr 

ctgtttgatg aggcactgcg cagagaccta 
lfd eal rrdl 

atcctgctac agtacgtaga tgacttactg 
ill qyv ddll 

caaggtactc gggccctgtt acaaacccta 
qgt ral lqtl 

aaagcccaaa tttgccagaa acaggtcaag 
kaq icq kqvk 

agatggctga ctgaggccag aaaagagact 
rwl tea rket 

cggcaactaa gggagttcct agggacggca 
rql ref lgta 

gcagaaatgg cagccccctt gtaccctctc 
aem aap lypl 

ccagaccaac aaaaggccta tcaagaaatc 
pdq qka yqei 

gggttgccag atttgactaa gccctttgaa 
glp dlt kpfe 
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Table 3 (cont.) 

1201 ctctttgtcg acgagaagca gggctacgcc aaaggtgtcc taacgcaaaa actgggacct 
lfv dek qgya kgv ltq klgp 

1261 tggcgtcggc cggtggccta cctgtccaaa aagctagacc cagtagcagc tgggtggccc 
wrr pva ylsk kid pva agwp 

1321 ccttgcctac ggatggtagc agccattgcc gtactgacaa aggatgcagg caagctaacc 
pel rmv aaia vlt kda gklt 

1381 atgggacagc cactagtcat tctggccccc catgeagtag aggcactagt caaacaaccc 
mgq plv ilap hav eal vkqp 

1441 cccgatcgat ggctttccaa cgcccggatg actcactatc aggecttget tttggacacg 
pdr wis narm thy qal lldt 

1501 gaccgggtcc agttcggacc ggtggtagcc ctgaacccgg ctacactgct cccactgcct 
drv qfg pvva lnp atl lplp 

1561 gaggaagggc tgcagcacaa ctgecttgat atcctggccg ; aagcccacgg aacccgaccc 
eeg lqh ncld ila eah gtrp 

1621 gaectaaegg accagccgct cccagacgcc gaccacacct ggtacacggg tggatccagt 
dlt dqp lpda dht wyt ggss 

1681 etcttgeaag agggacagcg taaggcggga gctgcggtga ccaccgagac cgaggtaatc 
llq egq rkag aav tte tevi 

1741 tgggctaaag ccctgccagc cgggacatcc getcageggg ctcagctgat agcactcacc 
wak alp agts aqr aql ialt 

1801 caggccctaa ggatggcaga aggtaagaag ctaaatgttt atacgaattc ccgttatgct 
qal rma egkk lnv ytn srya 

1861 tttgctactg cccatatcca tggagaaata tacagaaggc gtgggttgct cacatcagaa 
fat ahi hgei yrr rgl ltse 

1921 ggcaaagaga tcaaaaataa ggacgagata ttggccctac taaaagcect ctttctgccc 
gke ikn kdei lal lka lflp 

1981 aaaagactta gcataatcca ttgtccagga catcaaaagg gacacagcgc cgaggctaga 
krl sii hepg hqk ghs aear 

2 041 ggcaaccgga tggctgacca agcggcccga aaggcageca tcacagagaa tccagacacc 
gnr mad qaar kaa ite npdt 

2101 tctaccctcc tcatagaaaa ttcatcaccc aattcccgct taattaatta a 
stl lie nssp nsr lin 



[0246] Table 4 provides a list of the point mutations introduced in the M- 

MLV reverse transcriptase coding sequence of pRT601 to produce the plasmid 
used. The numbering of the point mutations corresponds to the nucleotide 
sequence presented in Table 3. 
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1675 


aj 


c o c 
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g.c 


801 
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1950 
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[0247] The mutations which were introduced to make RNase H- mutants of 

M-MLV reverse transcriptase are D524G, D583N, and E562Q, The 
remaining mutations were introduced to insert or remove restriction enzyme 
sites to facilitate the production of appropriately sized segments for the 
random PCR mutagenesis. This RNase H- mutant is referred to herein as 
Superscript™ II or Superscript™ II gene. 

[0248] The sequence of the M-MLV reverse transcriptase was engineered to 

introduce restriction enzyme cleavage sites as shown schematically in Fig. 2 
without changing the amino acids encoded by the sequence. The sequence 
was divided into 5 segments and oligonucleotides were designed so that each 
segment could be amplified. The segments roughly corresponded to the 
coding sequences of the five separate structural subdomains of RT 
(Kohlstaedt, et al. 9 (1992) Science 256, 1783-1789, Jacobo-Molina, et ah, 
(1993) Proc. Natl Acad. Set USA 90, 6320-6324). Segments one through 
four corresponded to the polymerase subdomain of fingers, palm, thumb, and 
connection, respectively, and segment five corresponded to the RNase H 
domain (Fig. 2). An upper limit cut-off of 1 to 2 mutations per segment was 
set as the target for mutation frequency to suppress accumulation of 
deleterious mutations, and to minimize the amount of screening required to 
find active mutants. Mutation frequencies of > 5 mutations/segment in 
segment one, two, or three produced only about 5% active mutants with all the 
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mutants having less than wild-type activity. At the mutation frequency used 
of 1 to 2 mutations per segment, approximately one-third of the mutants had 
little or no activity, one-third had less than 50% of His 6 H- RT activity, and 
one-third had up to 100% of His 6 H- RT activity. 

[0249] Segments were prepared from pBAD-6-His-M-MLV H- (Fl) by 

restriction enzyme digests and the segments were gel purified away from the 
vector backbone. Each segment was randomly mutagenized by PCR in the 
presence of manganese. The PCR conditions were standard except that 0.25 
mM MnCb was present, and the nucleotide triphosphate concentration was 
limited to 20 |aM of each dNTP (50 mM TrisHCl pH 8.3, 50 mM KC1, 3 mM 
MgCl 2 , 20 ^iM dGTP, 20 ^iM dCTP, 20 jaM dATP, 20 |aM dTTP, 1 unit Taq 
DNA polymerase per 100 jil reaction). The PCR product was extracted with 
phenol-chloroform, precipitated with ethanol and the mutated segments were 
cloned into a vector from which the given segment had been removed. 

[0250] In some random mutagenesis experiments, mutagenic PCR was 

performed in a reaction mixture (100 jixl) containing 20 mM Tris-HCl (pH 
8.4), 50 mM KC1, 1.8 mM MgCl 2 , 0.3 mM MnS0 4 , 200 jaM each of dCTP, 
dGTP, dTTP, and dATP, and 0.5 units of Taq DNA polymerase (Invitrogen 
Corporation, Carlsbad, CA). After a 1 min denaturation step at 94 °C, the 
cycling protocol was 15 sec at 94 °C, 15 sec at 55 °C, and 30 sec at 72 °C for 
20 cycles. Amplification was 100 fold from 50 ng of target to 5 |ng of 
amplified product. PCR primers included appropriate restriction endonuclease 
cut sites. An amplified DNA segment was cleaved with appropriate restriction 
endonucleases, gel purified, and cloned into gel purified vector DNA cut with 
the corresponding restriction enzymes. The vectors containing the mutated 
segments were transformed into appropriate host cells to produce a library. 

[0251] Libraries were sampled by DNA sequencing of the mutagenized H- RT 

gene segment of a small number of clones to determine the mutation 
frequency. The goal was to achieve rates of PCR random mutagenesis that 
produced 1 to 2 mutations per segment. It was found that random mutagenesis 
that produced greater than 2 mutations per segment tended to produce a large 
proportion of inactive RT mutants. If the library met this criterion, further 
screening by heat treatment was carried out to identify mutants that showed 



- 114- 

greater RT activity in lysates than H- RT after a heat treatment step. Those 
mutants with the highest apparent thermal stability were screened again in 
duplicate by pretreatment at 24 °C (to normalize for activity) and at 52 to 
58°C to confirm the presence of enhanced thermal stability. In all, about 
15,000 clones were screened by heat treatment in the 96-well format for each 
segment or a total of about 100,000 mutants. Libraries of transformants for 
each mutated segment were screened for thermostable variants. 

Example 2: Screening for Thermostable Reverse Transcriptases 

[0252] In this example the following solutions were used: 

EG-per liter: 20 g bacto-tryptone, 10 g bacto yeast extract, 2 ml glycerol, 0.54 
gNaCl, 0.194 gKCl 

EG-arabinose-150 ml EG plus 1.5 ml of 10 mg/ml ampicillin and 1.5 ml of 
20% (w/v) arabinose (if plates are to have arabinose) 

20X PEB-I Buffer-18% (w/v) glucose, 500 mM Tris-HCl (pH 8.0), 200 mM 
EDTA 

Kinase Storage Buffer-50% (v/v) glycerol, 20 mM Tris-HCl (pH 8.0), 100 
mM KC1, 5 mM pME 

100 mg/ml lysozyme- made in Kinase Storage Buffer and stored at -20°C 
2X PLD-5 ml of 20X PEB-I, 1 ml of 1 M DTT, 5 ml of 10% (v/v) Triton X- 
100, 1 ml of 100 mg/ml lysozyme and 38 ml of water 

2X PZD- 0.5 ml of 20X PEB-I, 100 ul of 1 M DTT, 0.5 ml of 10% (v/v) 
Triton X-100, 10 ul of zymolase and 3.9 ml of water 

10X Poly(C) Reaction Buffer-500 mM Tris-HCl (pH 8.4), 500 mM KC1, 100 
mM MgCl 2 

1.25X Reaction Mix-1 ml of 10X Poly(C) Reaction Buffer, 100 ul of 1 M 
DTT, 1 ml of poly(C)/oligo(dG) (30 mM/12 mM in nucleotide), 10 ul of 100 
mM dGTP, 5.87 ml of water and 20 ul of [a- 32 P] dGTP at 10 uCi/ul 
[0253] E. coli DH10B (Invitrogen Corporation, Carlsbad, CA) was used in all 

experiments. Bacterial liquid cultures were grown at 37 °C in EG: 2% 
tryptone, 1% yeast extract, 0.5% glycerol, 10 mM NaCl, and 1 mM KC1. Solid 
medium was LB (1% bacto-tryptone, 0.5% yeast extract, and 86 mM NaCl) + 
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1.5% agar. Selective media included 100 |ng/ml ampicillin. For induction of 
cultures, cells were inoculated into selective EG + 0.2% arabinose and grown 
for 18 hr. 

[0254] Mutant populations were plated on selective agar. Individual 

transformant colonies were inoculated into single wells of a 96 well culture 
plate. Each well contained 120 \xl of EG-Ap medium (EG medium with 100 
jag/ml ampicillin). Although colonies from the selective agar may be grown in 
media containing 0.2% arabinose, it is preferable to first inoculate a 96 well 
plate with selective medium without the inducer, to grow that master plate 
overnight, and then to make a replica of the master plate into a 96-well plate 
with the inducer and grow that plate overnight. 

[0255J An aliquot of the un-induced culture from each well was transferred 

into a new 96-well plate containing 120 jal per well of selective media + 0.2% 
arabinose. The cultures containing the inducer were grown overnight (e.g., 
15-20 hours) at 37°C without shaking to induce RT production. An aliquot (5 
jj,1) of culture from each well was then transferred into another 96-well plate 
containing 5 per well of 2XPLD(50 mM Tris-HCl, pH 8.0, 20 mM EDTA, 
1.8% (w/v) sucrose, 1% (v/v) Triton X-100, 10 mM DTT, and 2 mg/ml 
lysozyme) at room temperature. These extracts were sometimes assayed 
directly for reverse transcriptase before the heating step. The amount of RT 
activity in a 5-jal aliquot of extract was within the linear range of the assay. 
Lysates were stable at room temperature for at least 1 hr. 

[0256] The extracts were heated, for example, using a water bath or 

thermocycler, for 5 or 10 minutes at temperatures that ranged from 50°C to 
60°C. Preferably, the cultures were heated for 5 minutes at 52°C. After 
heating and cooling to room temperature, RT activity in a 5-jal aliquot from 
the lysate in each well was assayed with (rC) n «(dG)is in another 96-well plate. 
An aliquot (5 to 10 pi) of the extract was mixed with 1.25X RT reaction mix. 
This reaction was placed in a 37°C water bath for 10 minutes. A small aliquot 
of the reaction mixture (5 jal) was spotted onto a charged nylon membrane 
(Genescreen+, NEN). The membrane was washed twice with 10%TCA +1% 
sodium pyrophosphate, rinsed with ethanol, dried, and placed next to a 
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phosphor screen. As an alternative, the membrane may be washed twice with 
4% sodium pyrophosphate (pH 8.0), rinsed with ethanol, dried, and then 
placed next to a phosphor screen. Radioactive product that had been trapped 
on the filter was detected by analyzing the screen in a Posphorimager, using 
ImageQuant software (Molecular Devices). 

[0257] Candidates were selected if they showed more reverse transcriptase 

activity (radioactivity) after the heat inactivation step. These candidates were 
screened a second time to confirm the phenotype. Candidates which appeared 
to be thermostable after the second screen were grown in small cultures and 
tested a third time for thermostable reverse transcriptase activity. Candidates 
that were reproducibly heat resistant were .sequenced and the mutation in each 
clone was determined. 

[0258] Plasmid DNA was prepared from an over night E. coli culture bearing 

an RT mutant using a Concert High Purity Miniprep Kit (Invitrogen 
Corporation, Carlsbad, CA) following the manufacturer's instructions. Each 
DNA was sequenced using a forward and reverse primer bordering the 
segment that had been mutagenized to generate the mutant. The sequencing 
reactions were carried out as specified for plasmid DNA using the ABI Big 
Dye Terminator Sequencing Ready Reaction Kit. The reactions were 
analyzed using an ABI PRISM 377 DNA Sequencer. 

[0259] An oligonucleotide corresponding to the mutagenized site was 

designed in which the codon for the mutagenized amino acid was randomized 
(NNK or NNN). Oligonucleotide site-directed mutagenesis was carried out by 
established procedures. Saturation of an amino acid coding site in the H- RT 
gene with all possible amino acids was performed by introducing the sequence 
NNK (N = A,C,G or T and K = G or T) at the codon site into the mutagenic 
oligonucleotide. These oligonucleotides were used in site-directed 
mutagenesis to generate a library in which all possible substitutions at the 
mutagenized site were made. This library was screened for thermostable 
reverse transcriptase activity, and the most promising clones were sequenced. 

[0260] Screening of mutants in Segment 2 (see Fig. 2) resulted in the 

identification of one mutant, H204R. Screening of a library mutagenized at 
site H204 resulted in several mutants, but the only one that was more 
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thermostable than M-MLV reverse transcriptase was another H204R mutant. 
H204R mutants of M-MLV reverse transcriptase have enhanced 
thermostability. Screening of mutants in segment 3 (see Fig. 2) resulted in one 
mutant, T306K. Randomization of the T306 position produced thermostable 
mutants which, when sequenced, were T306R. Both T306K and T306R 
mutants of M-MLV reverse transcriptase have about 1.5 fold enhanced 
thermostability. 

Example 3: TdT Reverse Transcriptase Mutants 

[0261] In checking fidelity mutants pf reverse transcriptase (RT) for 

misextension in a 3 dNTP assay, it was observed that SUPERSCRIPT™ II 
reverse transcriptase extended 2-3 bases past the end of the template in the 
presence of 3 and 4 dNTPs. This non-template directed extension or TdT 
activity is reduced in many mutants, but in a few such as F309N and T197E it 
appears that this activity is severely reduced or eliminated. These mutants are 
probably in close proximity or in contact with the template-primer as 
determined by homology to HIV reverse transcriptase and its crystal structure 
with bound template-primer. 

METHODS 

Mutagenesis 
[0262] For F309N: 

[0263] Primers were designed corresponding to the mutant position F309 with 

the silent insertion of a NgoMTV restriction site at amino acid positions 310- 
311. The primers encoded a random NNK sequence for this position 
generating a random library of F309 mutants, where N is any of the four bases 
and K is T or G. The primers along with internal SUPERSCRIPT™ II reverse 
transcriptase primers at an upstream SstI restriction site and a downstream Sail 
restriction site were used in a standard PCR reaction (10 ng SUPERSCRIPT™ II 
reverse transcriptase template, 2 jaM of each primer, 48 ^il SuperMix 
(Invitrogen Corporation (Carlsbad, CA)) for 20 cycles of 94°C 15 sec, 55°C 15 
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sec, 72°C 30 sec) to generate two PCR fragments. These were a 240 base pair 
Sstl-NgoMTV fragment and a 200 base pair A^oMIV-Sall fragment. The 
fragments were isolated and digested and ligated together and then inserted 
into the original SUPERSCRIPT™ II reverse transcriptase clone cut with SstI 
and Sail, The resulting ligation product was transformed in Max Efficiency 
DH10B (Invitrogen Corporation (Carlsbad, CA)) competent cells to create the 
library of mutants at site F309. This library was then plated overnight for 
selection. 

ForT197E andY133A: 

The mutants T197E and Y133A were made by oligo-directed 
mutagenesis as described in Kunkel, T.A. et al Methods Enzymol 204:125 
(1991). Briefly, the SUPERSCRIPT™ II reverse transcriptase gene was inserted 
into pBADhisA (Invitrogen Corporation) vector and named pBAD-SSII. This 
plasmid was transformed into DH11S cells and the cells were infected with 
M13K07 helper phage from which single strand DNA was isolated. Oligos 
were designed corresponding to each mutation: T197E and Y133A. Each 
oligo (100 ^iM) was kinased with T4 polynucleotide kinase (Invitrogen 
Corporation (Carlsbad, CA)) using the Forward Reaction Buffer (Invitrogen 
Corporation (Carlsbad, CA)). The oligo was annealed to single stranded 
pBAD-SSII DNA. Native T7 DNA polymerase (USB) and T4 DNA ligase 
(Invitrogen Corporation (Carlsbad, CA)) were added with synthesis buffer (0.4 
mM dNTPs, 17.5 mM Tris-HCl, pH 7.5, 5 raM MgCl 2 , 2.5 mM DTT, and 1 
mM ATP) to the annealed reaction on ice. The reactions were incubated at 
37°C for 30 minutes and terminated by adding 1 |il of 0.5 M EDTA. The 
reactions were transformed and plated with DH10B cells. Colonies were 
picked and mutants were determined by restriction enzyme analysis and 
sequenced for confirmation using an ABI 377 instrument and ABI Big Dye 
Terminator Cycle Sequencing Ready Reaction kit. 

Selecting Colonies Containing Active Reverse Transcriptase. 

Individual transformant colonies were inoculated into single wells of a 
96 well culture plate. Each well contained 120 jul of media (EG-Ap) 
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containing 0.2% arabinose. It is preferable to first inoculate a 96 well plate 
with selective medium without the inducer, to grow that master plate 
overnight, and then to make a replica of the master plate into a 96-well plate 
with the inducer and grow that plate overnight. The cultures were grown 
overnight at 37°C without shaking. Overnight cultures were mixed with an 
equal volume of 2X PLD (1.8% glucose, 50 mM Tris-HCl, pH 8.0, 20 mM 
EDTA, 20 mM DTT, 1% Triton X-100, 2 mg/mL lysozyme) at room 
temperature. These extracts were assayed directly for reverse transcriptase 
activity by mixing 10 |ul of the extract with 40 jal of 1.25X RT reaction mix 
(62.5 mM Tris-HCl, pH 8.4, 62.5 mM KC1, 12.5 mM MgCl 2 , 12.5 mM DTT, 
1.25 mM dGTP, polyC/oligo dG (3.75 mM/1.5 mM in nucleotide), [ 32 P] 
dGTP). This reaction was placed in a 37°C water bath for 10 minutes. A 
small aliquot of the reaction mixture (5 pi) was spotted onto a charged nylon 
membrane (Genescreen+, NEN). The membrane was washed twice with 
10%TCA + 1% sodium pyrophosphate, rinsed with ethanol, dried, and placed 
next to a phosphor screen. Radioactive product that had been trapped on the 
filter was detected by analyzing the screen in a Phosphorimager, using 
ImageQuant software (Molecular Devices). Candidates were selected if they 
showed reverse transcriptase activity (radioactivity). These candidates were 
screened a second time to confirm the phenotype. The confirmed candidates 
were then sequenced to determine which amino acids maintained detectable 
reverse transcriptase activity. 

Purification of Reverse Transcriptase Mutants. 

[0267] The cell pellet containing induced reverse transcriptase was suspended 

in a ratio of 2 mL Lysis buffer (40 mM Tris-HCl, pH 8.0, 0.1 M KC1, 1 mM 
PMSF)/1 gram of cell pellet. The suspension was sonicated on ice and then 
centrifuged at 27,000 g for 30 minutes. The cell-free extract was filtered 
through a 0.45 [i syringe filter. The cell-free extract was applied to a 5 mL 
Ni HI-TRAP column (Pharmacia) pre-equilibrated with 5 volumes 5 mM 
imidazole in buffer A (40 mM Tris HC1, pH 8.0, 10% glycerol, 0.01% Triton 
X-100, 0.1 M KC1) at 1 mL/min. The column was washed with 10 volumes 5 
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mM imidazole in buffer A. The reverse transcriptase was eluted by washing 
with 20 volumes of a gradient of 5 mM to 1M imidazole in buffer A. The 
eluate containing reverse transcriptase protein was applied to a 1 mL Mono-S 
column (Pharmacia) pre-equilibrated with 10 column volumes 50 mM KC1 in 
buffer B (40 mM Tris-HCl, pH 8.0, 10% glycerol, 0.01% Triton X-100, 0.1 
mM EDTA, 1 mM DTT) at a flow rate of 1.0 mL/min. The column was 
washed with 10 volumes of 50mM KC1 in buffer B. Reverse transcriptase was 
eluted with 20 volumes of a gradient from 50 mM to 1 M KC1 in buffer B. 
The individual fractions were analyzed for RT activity. The fraction 
containing peak RT activity was dialyzed against storage buffer (40 mM Tris- 
HCl, pH 8.0, 50% glycerol, 0.01% Triton X-100, 0.1 mM EDTA, 1 mM DTT, 
0.1 M KC1). The purified reverse transcriptases were more than 95% pure, as 
judged by SDS-PAGE. The protein concentrations were determined by using 
the Biorad colorimetric kit. 

3 dNTP Assay Method. 

[0268] Procedures were modified from those of Preston, B.D., et ah Science 

242: 1 168 (1988). The DNA template-primer was prepared by annealing a 47- 
mer template (5'- 

GAGTTACAGTGTTTTTGTTCCAGTCTGTAGCAGTGTGTGAATGGAA 
G-3') (SEQ ID NO:6) to an 18-mer primer (5'-CTTCCATTCACACACTGC- 
3') (SEQ ID NO:7) [ 32 P]-labeled at the 5'-end with T4 polynucleotide kinase 
(template:primer, 3:1). Assay mixture (10 jil) contained 5 nM template- 
primer, 50-200 nM reverse transcriptase as specified in figure legends, 3 or 4 
dNTPs (250 (aM each), 50 mM Tris-HCl (pH 8.3), 75 mM KC1, 3 mM MgCl 2 , 
10 mM DTT. Reactions were incubated at 37°C for 30 minutes and 
terminated by the addition of 5 jxl of 40 mM EDTA, 99% formamide. 
Reaction products were denatured by incubating at 95°C for 5 minutes and 
analyzed by electrophoresis on urea 6% polyacrylamide gels. 
[0269] To determine if any TdT activity was occurring in the control reaction 

of the 3 dNTP assay, which uses all 4 dNTPs, the control reaction was 
repeated with varying amounts of enzyme, >600 units to 20 units, at 37°C for 
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30 minutes. For SUPERSCRIPT™ II, T197E, and Y133A, 200, 100, 50, and 20 
units were used. For F309N, 646, 200, 50, and 20 units were used. 

RESULTS 

[0270] We carried out a misinsertion assay of F309N (H204R, T306K) 

Superscript™ II reverse transcriptase, hereafter referred to as F309N, with 
DNA template. This assay was employed to compare the misincorporation 
capability of the mutant to Superscript™ II. The assay is a primer extension 
assay using synthetic DNA template-primer and biased dNTP pools containing 
only three of four dNTPs. The reactions are displayed on a gel in Fig. 3. 
While conducting this procedure to screen for mutants with lower 
misinsertion/misextension rates it was observed that Superscript™ II reverse 
transcriptase extended 2-3 nucleotides past the template end and that some 
mutations reduced or appeared to eliminate this non-template directed 
extension or TdT activity. As shown in Fig. 4, in the presence of all 4 dNTPs, 
Superscript™ II reverse transcriptase and the mutant F309N were able to 
extend the primer approximately equally, with SUPERSCRIPT™ II reverse 
transcriptase adding 2 nucleotides past the template, and F309N adding none 
beyond the end of the template. To further evaluate this non-templated 
directed extension the control reaction for the 3 dNTP misextension assay 
containing all 4 dNTPs was carried out with Superscript™ II, F309N, 
T197E, and Y133A reverse transcriptase for 30 minutes with varying amounts 
of enzyme. The three mutants had shown very reduced levels of TdT activity 
in prior screens. Since it had been observed that 5 minutes with 20 units of 
enzyme was more than enough time for the primer extension to be completed, 
a 30 minute incubation and 200 to 646 units of reverse transcriptase were both 
in large excess over what was necessary for the reaction to be completed. As 
seen in Fig. 4, all the reverse transcriptase reactions at the lowest amount 
tested had similar extension products to the reactions at the highest unit 
concentrations demonstrating that the reaction had gone to completion. 
Superscript™ II reverse transcriptase added 2 nucleotides past the end of the 
template, F309N and T197E did not extend past the end of the template, and 
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Y133A appears to have a small amount of product that is 1 nucleotide past the 
end of the template. 



Example 4: Dual Thermostable and TdT Mutants 



[0271] The F309 amino acid position in M-MLV reverse transcriptase (RT) 

aligns with the W266 position in HIV reverse transcriptase. This position is at 
the base of the thumb domain and is considered part of the minor groove 
binding tract which interacts with the minor groove of the template-primer. 
The mutations H204R and T306K have been shown to increase the 
thermostability of the enzyme. The F309N mutation in an H204R/T306K 
clone displays 2.3x lower mutation frequency in a lacZ forward assay (Table 
5) on RNA template and shorter extension products in a 3 dNTP extension 
assay than Superscript™ II reverse transcriptase or H204R/T306K in 
Superscript™ II reverse transcriptase. Both findings support the claim of an 
enzyme with higher fidelity (Table 6). 

Table 5 

Mutation Frequency of M-MLV Reverse Transcriptase High Fidelity Mutants 

Construct total plaques mutant plaques MF(x 10" 4 ) 

Superscript™ II 15689 87 39 

Superscript™ II 14410 83 41 

(H204R, T306K) 

Superscript™ II 11623 39 17 

(H204R, T306K,F309N) 

Superscript™ II 11415 39 14 

(H204R,T306K, 

F309N,V223H) 

Table 5. The mutation frequency of Superscript™ II reverse transcriptase 
and point mutants. Mutation frequency (MF) was determined by dividing the 
number of mutant plaques (light blue or white) by the total number of plaques. 
The background mutant frequency of the starting DNA was 17 x 10" 4 for the 
first 3 constructs and 20 x 10 -4 for the last construct. 
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Table 6 

Error Rates of M-MLV Reverse Transcriptase High Fidelity Mutants 







Superscript™ II 


F309N 


V223H/F309N 


Overall ER (oER) 


1/17,000 


1/15,000 


1/34,000 


1/41,000 


Mismatch 
% of total 
ER (mER) 


46 

1/37,000 


35 

1/42,000 


68 72 
1/50,000 


1/58,000 


Frameshift 
% of total 
ER (rER) 


46 

1/37,000 


60 

1/25,000 


21 22 
1/162,000 


1/188,000 


Strand Jump 
% of total 
ER (jER) 


8 

1/213,000 


5 

1/297,000 


11 6 

1/324,000 


1/690,000 



METHODS 



[0272] Mutagenesis. Using a standard site directed mutagenesis protocol, as 

described in Example 3, a primer containing the V223H mutation was 
annealed to single strand DNA of Superscript™ II with the following 
mutations: H204R, T306K, F309N. The colonies were sequenced to confirm 
the new combination of V223H, H204R, T306K, and F309N. 

[0273] Selecting Colonies Containing Active Reverse Transcriptase. Colony 

selection was performed as in Example 3. 

[0274] Purification of RT mutants. Purification was performed as in 

Example 3. 

[0275] Sequencing of plaques. The plaques from the lacL forward assay were 

transferred from the soft agar plate to Whatmann 3 MM paper and allowed to 
dry for at least 1 hour. The plaque was then punched out and the plaque/paper 
disk was added directly to a sequencing reaction mix containing 4-8 \x\ ABI 
PRISM Dye Terminator Cycle Sequencing Ready Reaction (Perkin Elmer), 1 
^il primer (GAAGATCGCACTCCAGCCAGC) (SEQ ID NO:8), and distilled 
water to 20 jal total volume. The ABI cycle sequencing protocol was used for 
96°C 10 seconds, 50°C 5 seconds, 60°C 4 minutes for 25 cycles. The paper 



- 124- 

disks were removed and the reactions were precipitated, then resuspended in 
loading dye and run on an ABI 377 sequencing machine. 

[0276] The sequences were compared to wild type lacZ alpha sequence and 

then classified as frameshift (either 1 nucleotide insertion or deletion), 
mismatch, or strand jump (an insertion or deletion between repeated 
sequences). The overall error rate for each class was determined by dividing 
the mutation frequency by the number of detectable sites (i.e., sites the 
alteration of which results in a phenotypic change) (116) multiplied by 0.5 (to 
exclude the original single strand contribution) and then multiplied by the 
percentage of mutants observed to be in each class. ER = MF/(detectable sites 
* 0.5) * (% in each class). 

[0277] 3dNTP assay method. 3dNTP assays were performed as in Example 3. 

RESULTS 

[0278] We carried out a misinsertion assay of F309N (H204R T306K) 

Superscript™ II reverse transcriptase, hereafter referred to as F309N, and 
V223H F309N (H204R T306K), hereafter referred to as V223H/F309N with 
DNA template. This assay was employed to compare the misincorporation 
capability of the mutant to Superscript™ II. The assay is a primer extension 
assay using synthetic DNA template-primer and biased dNTP pools containing 
only three of the four dNTPs. The reactions are displayed on a gel in Fig. 5 
and Fig. 6. In this assay, higher efficiency of primer extension denotes lower 
fidelity. As shown in Figs. 5 and 6, in the presence of all 4 dNTPs, 
Superscript™ II reverse transcriptase and the mutants F309N and 
V223H/F309N were able to extend the primer approximately equally, with 
some variance in the addition of non-template directed nucleotides at the end 
of the primer. However when incubated with biased pools of nucleotides, 
Superscript™ II reverse transcriptase was able to catalyze substantial 
extension past template nucleotides for which a complementary dNTP was 
missing, indicating use of incorrect nucleotides and lower fidelity. In Fig. 5, 
the F309N (2) mutant showed shorter extension products than Superscript™ 
II reverse transcriptase in each of the biased pools of three dNTPs, indicating 
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less ability to incorporate incorrect nucleotides and thus higher fidelity. In 
Fig. 6, the V223H/F309N mutant was extended with just the dATP and dCTP 
pools. In each case V223H/F309N also had lower extension products than 
Superscript™ II. This corresponds with the results of the lacZa assay 
where the F309N and V223H/F309N mutants had a lower mutation frequency 
than Superscript™ II reverse transcriptase (HxlO" 4 and MxlO" 4 to 39X10* 4 ). 
The reverse transcriptase with just the H204R T306K mutations without 
F309N has a mutation frequency similar to SUPERSCRIPT™ II reverse 
transcriptase (41X10 -4 to 39xl0" 4 ), suggesting that these mutations do not 
influence fidelity. This data shows a correlation between the misinsertion 
assay on DNA and the lacZa assay on RNA wherein higher fidelity mutants 
had both shorter extension products with biased pools of dNTPs and lower 
mutation frequencies in the lacZa assay. 

Example 5: Error Rate Determination 

To determine Error Rates, mutant plaques from the lacL forward assay 
were sequenced using known methods. The mutations were then classified 
into one of the following categories: mismatches for misinsertion events, 
frameshifts for single insertion or deletion events, or jumps for large insertions 
or deletions caused by jumping between similar sequences. An overall Error 
Rate was then determined for nucleic acid encoding the lacL alpha peptide 
using the following equation: 

ER (error rate) = MF (mutation frequency) / (number of detectable sites x 0.5), 
where the number of detectable sites is 1 16. 

Not all bases mutated in lacZ forward assays result in a detectable 
phenotypic change. To determine specific error rates for mismatch, frameshift 
and jumps, the mutation frequency was modified by multiplying by the 
percent of the total of each mutant category, and then used to determine the 
specific error rate. The following is a sequence map of the lacZa peptide in 
M13mpl9 from SUPERSCRIPT™ II reverse transcriptase and the high fidelity 
Superscript™ II H203R T306K F309N reverse transcriptase assays. 
Underlining indicates deletions; " A " indicates insertions of the base A, T, C, or 
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G shown above; A, T, C, or G shown above the complete sequence indicates 
mismatches. 

Map of Mutations Introduced by Superscript™ II 

T C 

T T TC C 

AGCGCAACGC AATTAATGTG AGTTAGCTCA CTCATTAGGC ACCCCAGGCT TTACACTTTA 
1 1 4 

CG C CC 

TGCTTCCGGC TCGTATGTTG TGTGGAATTG TGAGCGGATA ACAATTTCAC ACAGGAAACA 
1 

C 

CC CG C 

GCTATG ACC ATG ATT ACG^CCA AGC TTG CAT GCC TGC AGG TCG ACT CTA GAG GAT CCC CGG 

1 

T AAAA 
T A AAA 
T T A A 

T T T A T AC 

GTA CCG AGC TCG AAT TCA CTG GCC GTC GTT^TTA CAA CGT CGT GAC TGG GAA AAC CCT GGC 

7 111 

TTTTT 

C TTTTT 
C TTT 
ATT 

TC C C TTCTCTTCG T 

GTT ACC CAA CTT AAT CGC CTT GCA GCA CAT CCC"CCT"TTC"GCC AGC TGG CGT 

1 4 



AAT AGC G (SEQ ID NO: 9) 



Table 7 



Insertions 


40 


38% 


60% frameshift 
(insertion or 
deletion) 


Deletions 


23 


22% 




Mismatches 


36 


35% 


35% mismatch 


Jumps 


5 


5% 


5% Jumps 



Table 8 



Overall Error Rate (oER) 


1/15,000 


(SQxio-XnexO.s) 


Mismatch Error Rate (mER) 


1/42,500 


(0.35 x39xl0-")/(116x 0.5) 


Frameshift Error Rate (fER) 


1/25,000 


(0,60 x39xl0->(l 16x0.5) 


Jumps Error Rate (jER) 


1/297,000 


(0.05 x39xlO^)/(H6x 0.5) 
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Example 6: Analysis of Enzymatic Activity in Mutant RTs 



[0281] Reverse transcriptase (RT) (e.g., retroviral RT) is one of the most 

intensely studied DNA polymerases, both because of its use as an essential 
tool for the synthesis and cloning of cDNA and because of its importance as a 
target for inhibition of HIV (Skalka, A.M. and Goff, S.P., Reverse 
Transcriptase, Cold Spring Harbor Laboratory Press, Cold Spring, NY 
(1993)). A dichotomy exists however in that HIV RT, the focus of much of 
recent study (Le Grice, S.F.J, in Reverse Transcriptase (Skalka, A.M. and 
Goff, S.P., eds) Cold Spring Harbor Laboratory Press, Cold Spring, NY, pp. 
163-191, (1993)), has not been used widely as a tool for cDNA synthesis. 
This is because HIV RT has a relatively high error rate (Bebenek, K. and 
Kunkel, T.A. in Reverse Transcriptase (Skalka, A.M. and Goff, S.P., eds) 
Cold Spring Harbor Laboratory Press, Cold Spring, NY, pp. 85-102 (1993), 
and because it does not synthesize efficiently full-length copies of long 
mRNAs in vitro. Other forms of retroviral RT used widely to synthesize 
cDNA, M-MLV RT and AMV RT, also suffer from these limitations, but to a 
lesser extent (Bebenek, K. and Kunkel, T.A. in Reverse Transcriptase (Skalka, 
A.M. and Goff, S.P., eds) Cold Spring Harbor Laboratory Press, Cold Spring, 
NY, pp. 85-102 (1993); Krug, M.S. and Berger, S.L. Methods in Enzymol. 
752:316-325 (1987); Gerard, G.F. and D'Alessio, J.M. (1993) in Methods in 
Molecular Biology, Vol 16: Enzymes of Molecular Biology (Burrell, M.M., ed) 
pp. 73-93, Humana Press, Totowa, NJ (1993); Gerard, G.F., et al, Molecular 
Biotechnology 5:61-77 (1997)). In addition to polymerase activity, retroviral 
RT possesses RNase H activity that degrades the RNA in an RNA-DNA 
hybrid (Moelling, K., et al, Nature New Biology 234:240-244 (1971)). The 
presence of this degradative activity is responsible in part for the limitation on 
efficient synthesis of long cDNA (Krug, M.S. and Berger, S.L. Methods in 
Enzymol. 752:316-325 (1987), Berger, S.L., et al, Biochem. 22:2365-2372 
(1983)). The RNase H domain of RT can be mutated to reduce or eliminate 
RNase H activity while maintaining mRNA-directed DNA polymerase activity 
(Kotewicz, M.L., etal, Nuc. Acids Res. 76:265-277 (1988), DeStefano, J.J., et 
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al, Biochim. Biophys. Acta /27P:380-388 (1994)), improving the efficiency of 
cDNA synthesis (Kotewicz, M.L., et al, Nuc. Acids Res. 7(5:265-277 (1988)). 
| A second significant drawback to copying mRNA is the tendency of 

RT to pause during cDNA synthesis resulting in the generation of truncated 
products (Harrison, G.P., et al, Nuc. Acids Res. 26:3433-3442 (1998), 
DeStefano, J.J., et al, J. Biol. Chem. 266:7423-7431 (1991)). This pausing is 
due in part to the secondary structure of RNA (Harrison, G.P., et al, Nuc. 
Acids Res. 2(5:3433-3442 (1998), Wu, W., et al, J. Virol 70:7132-7142 
(1996)). Performing cDNA synthesis at reaction temperatures that melt the 
secondary structure of mRNA helps to alleviate this problem (Myers, T.W. 
and Gelfand, D.H., Biochem. 50:7661-7666 (1991)). In addition, the 
oligo(dT) n primer often used to initiate cDNA synthesis tends to prime at 
internal stretches of A residues in mRNA at lower temperatures, resulting in 
the synthesis of 3 '-end truncated cDNA products. M-MLV RT does not 
efficiently synthesize cDNA from mRNA above 43°C (Tosh, C, et al, Acta 
Virol 47:153-155 (1997)). RNase H-minus (H-) M-MLV RT can be used up 
to 48 °C because in the absence of RNase H activity the mRNA template- 
DNA product complex is maintained during cDNA synthesis in a structural 
form that protects RT from thermal inactivation. 

In an effort to raise the temperature at which M-MLV RT can be used 
to synthesize cDNA, we have randomly mutagenized the H- M-MLV RT gene 
and screened for thermal stable mutants. Several thermal stable mutants of H- 
M-MLV RT were identified and purified enzymes were characterized. We 
show that when the mutations are present together they increase RT thermal 
activity by increasing its intrinsic thermal stability without altering catalytic 
activity. 

EXPERIMENTAL PROCEDURES 

Bacterial Strains and Plasmids - E. coli DH10B (Invitrogen) was used 
in all experiments. Bacterial liquid cultures were grown at 37 °C in EG: 2% 
tryptone, 1% yeast extract, 0.5% glycerol, 10 mM NaCl, and 1 mM KC1. 
Solid medium was LB (1% bactotryptone, 0.5% yeast extract, and 86 mM 
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NaCl) + 1.5% agar. Selective media included 100 (ag/ml ampicillin. For 
induction of cultures, cells were inoculated into selective EG + 0.2% arabinose 
and grown for 18 hr. 

[0285] RNA and DNA - Chloramphenicol acetyl transferase (CAT) cRNA 

(-900 nt) with a 40-nucleotide poly(A) tail at the 3 '-end was synthesized by 
T7 RNA polymerase run-off transcription from linearized plasmid DNA 
(D'Alessio, J.M. and Gerard, G.F., Nuc. Acids Res. 75:1999-2014 (1988)). 
The cRNA was selected on oligo(dT)-cellulose to ensure the presence of a 
poly(A) tail For labeling, the 5' end of CAT cRNA was dephosphorylated 
with alkaline phosphatase. (rC) n , p(dT)i2-i8, and p(dT)25-30 were purchased 
from Amersham Pharmacia. (rA)63o was purchased from Miles. (dG)i5 and 
(dT) 2 o were from Invitrogen. A DNA 24-mer complementary to CAT cRNA 
that annealed between nucleotides 679 and 692 with its 5' end 146 nucleotides 
distant from the first base at the 5' end of the CAT cRNA poly(A) tail was 
from Invitrogen. Oligonucleotides to prime PCR and perform site-directed 
mutagenesis were from Invitrogen. 

[0286] M-MLV RT Gene - The M-MLV RT gene used in these studies was 

derived from pRT601 (Kotewicz, M.L., et al, Nuc. Acids Res. 75:265-277 
(1988), Gerard, G.F., et al., DNA 5:271-279 (1986)). The gene starting at 
nucleotide 2598 and ending at nucleotide 4628 (Shinnick, T.M., et al, Nature 
293:543-548 (1981)) was cloned under control of an araD promoter into 
plasmid pBAD/HisA (Invitrogen). The M-MLV gene was further modified by 
site-directed mutagenesis without changing amino acid coding to include 
several unique restriction endonuclease sites that divided the gene into five 
segments (Fig. 2A). The amino end of the protein contained a His 6 tag to 
simplify purification that included the following amino acids: 
MGGSHHHHHHGMASMTGGQQMGRDLYDDDDKH (amino acids 1-32 
of SEQ ID NO:2). The carboxy end of the protein contained the additional 
amino acids NSRLIN present as the result of subcloning from pRT601 (17). 
In addition, the M-MLV RT gene was mutated (D524G, E562Q, D583N) to 
eliminate RNase H activity. The final construct was designated pBAD-HSS2 
(Fig. 2B), and the gene and gene product were designated Hise H- RT, 
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[0287] DNA Sequencing - Plasmid DNA was prepared from an over night E. 

coli culture bearing an RT mutant using a Concert High Purity Miniprep Kit 
(Invitrogen) following the manufacturer's instructions. Each DNA was 
sequenced using a forward and reverse primer bordering the segment that had 
been mutagenized to generate the mutant. The sequencing reactions were 
carried out as specified for plasmid DNA using the ABI Big Dye Terminator 
Sequencing Ready Reaction Kit. The reactions were analyzed using an ABI 
PRISM 377 DNA Sequencer. 

[0288] Random Mutagenesis - Mutagenic PCR was performed in a reaction 

mixture (100 jul) containing 20 mM Tris-HCl (pH 8.4), 50 mM KC1, 1.8 mM 
MgCl 2 , 0.3 mM MnS0 4 , 200 \xM each of dCTP, dGTP, dTTP, and dATP, and 
0.5 units of Taq DNA polymerase (Invitrogen Corporation, Carlsbad, CA). 
After a 1 min denaturation step at 94 °C, the cycling protocol was 15 sec at 94 
°C, 15 sec at 55 °C, and 30 sec at 72 °C for 20 cycles. Amplification was 100 
fold from 50 ng of target to 5 \xg of amplified product. PCR primers included 
appropriate restriction endonuclease cut sites. An amplified DNA segment 
was cleaved with appropriate restriction endonucleases, gel purified, and 
cloned into gel purified vector DNA cut with the corresponding restriction 
enzymes. 

[0289] Site-directed Mutagenesis - Oligonucleotide site-directed mutagenesis 

was carried out by established procedures (Kunkel, T.A., et aL, Methods 
Enzymol 754:812-819 (1987), Kunkel, T.A., et aL, Methods Enzymol 
204:125-139 (1991)). Saturation of an amino acid coding site in the H- RT 
gene with all possible amino acids was performed by introducing the sequence 
NNK (N = A,C,G or T and K - G or T) at the codon site into the mutagenic 
oligonucleotide. 

[0290] DNA Polymerase Assays - During screening of RT mutants, RT RNA- 

directed DNA polymerase activity was assayed with (rC) n *(dG)i 5 , which is 
specific for RT under the reaction conditions used (Gerard, G.F., et aL, 
Biochem. 75:1632-1641 (1974)). Reaction mixtures (50 jal) containing 50 mM 
Tris-HCl (pH 8.4), 50 mM KC1, 10 mM MgCl 2 , 300 ^iM (rC) n , 120 ^iM 
(dG)i5, 0.01% (v/v) Triton X-100, and 100 ^M [a- 32 P]dGTP (1,000 
cpm/pmole), were incubated at 37 °C for 10 min in the wells of a 96-well 
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plate. An aliquot (5 \xl) from each well was spotted onto a Genescreen+ 
(NEN) filter and the filter was washed twice for 10 min with 4% (w/v) sodium 
pyrophosphate (pH 8.0). Radioactivity bound to the dried filter was quantified 
in a phosphorimager (Molecular Dynamics). 

[0291] RT DNA polymerase unit activity was assayed with (rA) 6 3o*p(dT)i 2 .i8 

(Houts, G.E., et al t J. Virol 29:5 17-522 (1979)). One unit of DNA 
polymerase activity is the amount of RT that incorporates one nmole of 
deoxynucleoside triphosphate into acid insoluble product at 37 °C in 10 min. 

[0292] cDNA synthesis from CAT cRNA was carried out in reaction mixtures 

(20 |al) containing 50 mM Tris-HCl (pH 8.4), 75 mM KC1, 3 mM MgCl 2 , 10 
mM dithiotreitol (DTT), 500 jiM'each of dATP, dTTP, dGTP, and [a- 
32 P]dCTP (300 cpm/pmole), 1,750 units/ml RNase Inhibitor, 130 |ag/ml (465 
nM) CAT cRNA, 20 ^ig/ml (2,300 nM) p(dT) 25 -30, and 3,250 units/ml (100 
nM) RT. Incubation was at various temperatures for 60 min in individual 
tubes. An aliquot of the reaction mixture was precipitated with TCA to 
determine yield of cDNA synthesized, and the remaining cDNA product was 
size fractionated on an alkaline 1.2% agarose gel (McDonell, M.W., et al, J. 
Mol Biol 11 0:199-146 (1977)). 

[0293] To establish mono- and divalent metal reaction optima, initial reaction 

rates were determined under conditions of limiting RT concentration during a 
10-min incubation at 37 or 50 °C. Reaction mixtures (20 \\X) contained 50 mM 
Tris-HCl (pH 8.4), 10 mM DTT, 500 jiM each of dTTP, dATP, dCTP, and 
[ 3 H]dGTP (100 cpm/pmole), 10 pmoles (2.8 |ig) CAT cRNA, 50 pmoles DNA 
24-mer, 0.5 pmoles RT, and KC1 and MgCl 2 , varied in concentration one at a 
time. 

[0294] Steady-state Kinetic Measurements - The steady-state kinetic 

parameters K m ( dT n>) and £ cat were determined as described (Polesky, A.H., et 
al, J. Biol Chem. 255:14579-14591 (1990)), using (A) n #(dT) 30 . A range of 
five [ 32 P]dTTP concentrations, which bracketed the AT m (dTTP) value, was used 
for each determination of the kinetic parameters. The concentration of the 
template-primer and enzyme in the reaction were 2 \iM (in primer termini) and 
4 |nM, respectively. Reaction mixtures (50 jal) also contained 50 mM Tris- 
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HC1, pH 8.4, 75 mM KC1, 3 mM MgCl 2 , and 10 mM DTT and were incubated 
at 37 °C. 

[0295] Mutant Screening - Mutant populations were plated on selective agar. 

Individual colonies were inoculated into 120 jal of selective EG in 96 well 
plates and grown overnight at 37 °C. The cell density of the cultures was ~10 9 
cfu/ml. An aliquot of the culture from each well was transferred into a new 
96-well plate containing 120 jal per well of selective media + 0.2% arabinose. 
This plate was incubated at 37 °C for 20 hr to induce expression of RT. An 
aliquot (5 jal) of culture from each well was then transferred into another 96- 
well plate containing 5 per well of 2XPLD (50 mM Tris-HCl, pH 8.0, 20 
mM EDTA, 1.8% (w/v) sucrose, 1% (v/v) Triton X-100, 10 mM DTT, and 2 
mg/ml lysozyme). After heating for 5 to 10 min at various temperatures (52 to 
58 °C) in a thermocycler and cooling to room temperature, RT activity in a 5- 
jil aliquot from the lysate in each well was assayed with (rC) n *(dG)i 5 in 
another 96-well plate as described above. The amount of RT activity in a 5-|al 
aliquot of extract was within the linear range of the assay. Lysates were stable 
at room temperature for at least 1 hr. 

[0296] Purification of RTs - All operations were at 4 °C. Induced E. coli cells 

(5 g) bearing pBAD-HSS2 or a derivative were suspended in 10 mL of buffer 
(20 mM Tris-HCl, pH 8.0, 100 mM KC1, and 1 mM PMSF) and were 
sonicated for disruption. After clarification by centrifugation at 20,000 x g for 
30 min, RT was purified by sequential chromatography on a 5-ml Chelating 
Sepharose column charged with Ni 2+ and a Mono S HR 5/5 column 
(Amersham Pharmacia). In some cases the RT was fractionated on a third 
column (AF-Heparin-650 from TosoHaas) to eliminate traces of RNase 
contamination. The purified RT was dialyzed against storage buffer (40 mM 
Tris-HCl, pH 8.0, 100 mM KC1, 0.01% (v/v) Triton X-100, 0.1 mM EDTA, 1 
mM DTT, and 50% v/v glycerol) and stored at -20 °C. RT purified by this 
procedure was >90% homogeneous as judged by SDS PAGE and was free of 
detectable contaminating DNA endonuclease, DNA exonuclease, and RNA 
endonuclease. 
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[0297] Thermal Inactivation Profile of RT in Extracts - Cell lysates prepared 

in PLD as described already were heated in a 96-well plate in a thermocycler 
in which a temperature gradient exists through the rows, but the temperature in 
each column is the same. After incubation for 5 min at temperatures ranging 
from 25 °C to 56 °C, RT activity was assayed with (rC) n »(dG)i 5 as described 
already. 

[0298] Half Life Determination - Mixtures (20 |al) were incubated for various 

times in 0.5-ml tubes in a thermocycler at 50 or 55 °C and contained 50 mM 
Tris-HCl (pH 8.4), 75 mM KC1, 3 mM MgCl 2 , 10 mM DTT, 0.1% (v/v) Triton 
X-100, and 3-7 jig/ml purified RT. Incubation was stopped by placing the 
tube in ice. An aliquot (5 jxl) was assayed for residual activity with 
(rA) 63 o*p(dT)i2-i 8 . 

[0299] Measurement of K D by Filter Binding - A nitrocellulose filter-binding 

assay (Bailey, J.M., Anal Biochem. P5:204-206 (1979), Strauss, H.S., et al, 
Gene 75:75-87 (1981)) was used to determine the nucleic acid binding 
constants (K D ) of RTs for CAT cRNA«(dT) 20 . Dephosphorylated CAT cRNA 
was labeled at the 5' end with [y- 32 P]ATP and T4 polynucleotide kinase 
(Boehringer). Oligo(dT) 20 was annealed to the poly(A)-tailed CAT cRNA in a 
buffer containing 10 mM Tris-HCl, pH 7.5, and 80 mM KC1 at 65 °C for 5 
min followed by chilling on ice. Reaction mixtures (100 jal) containing 
binding buffer (50 mM Tris-HCl, pH 8.4, 75 mM KC1, 3 mM MgCl 2 , and 10 
mM DTT), 0.05 nM 32 P-labeled CAT cRNA, 1 nM (dT) 20 , and 1 to 50 nM RT 
were incubated at 23 °C for 5 min. After incubation, the mixture was filtered 
through a nitrocelullose filter (Millipore, HA 0.45 mm) soaked in binding 
buffer, which was then washed with binding buffer. The K D is equal to that 
enzyme concentration at which 50% of the labeled CAT cRNA is bound. For 
this method of analysis to be valid, the CAT cRNA concentration in the 
reaction must be substantially below AT D , so that the total enzyme 
concentration approximates the concentration of free unbound enzyme. 
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RESULTS 

Mutants Generated by Random Mutagenesis - PCR primers were 
designed to amplify each of five segments of the M-MLV His6 H- RT gene 
(Fig. 2A) and to contain appropriate restriction endonuclease cut sites. Each 
segment was randomly mutagenized by PCR mutagenesis (Experimental 
Procedures), digested with the appropriate restriction endonucleases, and 
cloned into a similarly digested pBAD-HSS2 plasmid in such a way as to 
replace the corresponding segment in the H- RT gene with the mutagenized 
segment (Fig. 2B). Libraries were sampled by DNA sequencing of the 
mutagenized H- RT gene segment of a small number of clones to determine 
the mutation frequency. The goal was to achieve rates of PCR random 
mutagenesis that produced 1 to 2 mutations per segment. We found that 
random mutagenesis that produced greater than 2 mutations per segment 
tended to produce a large proportion of inactive RT mutants. If the library met 
this criterion, further screening by heat treatment was carried out to identify 
mutants that showed greater RT activity in lysates than H- RT after a heat 
treatment step (Experimental Procedures; Fig. 7A). Those mutants with the 
highest apparent thermal stability were screened again in duplicate by 
pretreatment at 24 °C (to normalize for activity) and at 52 to 58°C to confirm 
the presence of enhanced thermal stability (Fig. 7B). In all, about 15,000 
clones were screened by heat treatment in the 96-well format for each segment 
or a total of about 100,000 mutants. 

Two mutants were found that produced RT with greater thermal 
stability than H- RT. DNA sequencing of their genes revealed them to be 
H204R in segment 2 and T306R in segment 3. Having identified two amino 
acid locations that influence RT thermal stability, we wished to establish if the 
particular mutant amino acid selected gave the greatest possible increase in 
thermal stability. The two sites in the H- RT gene were independently 
subjected to site-directed mutagenesis in which the sequence NNK was 
substituted for the existing codon in the mutagenic oligonucleotide 
(Experimental Procedures). Eighty-four members from each of these libraries 
were screened for thermal stable mutants. The results showed that T306K and 
T306R were the most thermal stable variants at T306, with T306K being 
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somewhat more thermal stable, and H204R was the most thermal stable 
variant at H204. 

[0302] The mutations were combined in one clone by sequential site-directed 

mutagenesis. Assay of heat-treated extracts showed that the double mutant 
His 6 H- H204R T306K RT was more thermal stable than either mutant alone 
and much more thermal stable than His 6 H- RT (Fig. 8). 

[0303] Utilizing plasmid pBAD-HSS2 that contained the H204R and T306K 

mutations as a starting point, RT gene segments 1 through 5 were again 
randomly mutagenized and libraries were screened for additional mutations 
that rendered His 6 H- H204R T306K RT more thermal stable. One additional 
mutation, M289L, in segment- 3 was identified that increased the thermal 
stability of the RT double mutant in crude extracts. 

[0304] Mutants from Single-Site Mutagenesis - His 6 H- T306K RT was 

mutated at position F309 to F309N by site-directed mutagenesis as part of a 
study of the fidelity of H- RT. The F309N mutation increased the thermal 
stability of His 6 H- T306K RT in crude extracts. 

[0305] Thermal Stability of Purified Mutant RTs - A number of RT mutants 

were purified to near homogeneity (Experimental Procedures) in order to 
establish their intrinsic thermal stability in the absence of contaminants and to 
characterize their enzymatic properties. The intrinsic thermal stability of 
purified forms of single mutants of His 6 H- RT at each of the 4 positions were 
established and compared with that of the starting enzyme. Mutations T306K, 
M289L, or F309N increased the half-life of His 6 H- RT a small amount at 50 
°C from 8 min to between 10 and 13 min (Table 9). 
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Table 9 

Half-lives of purified RT mutants at 50 °C 



Enzyme 


Half-life at 50 °C (min) a 


H- RT (Superscript™ II) 


3.2 ± 0.2 


His 6 H- RT 


8± 1.2(1.6±0.3) b 




His 6 H- H204R RT 


43 ±9 


His 6 H- T306K RT 


10 ±0.4 


His 6 H- F309N RT 


13+3 [ 


His 6 H- M289L RT 


13 ± 1 


His 6 H- H204R T306K RT 


78 ±8 




H- H204R T306K F309N RT 


30 


His 6 H- H204R T306K F309N RT 


105 ± 11 




Hxs 6 H- H204R M289L T306K F309N RT 240 ± 10 (8. 1 + 0 3) 



'Mean ± standard deviation of two or three determinations 



b Half-lives at 55°C are in parentheses 



[0306] The single mutation with the greatest impact was H204R, which 

increased the half-life of RT at 50 °C 5-fold (Table 9). However, combining 
the mutations had at a minimum a positive additive effect upon thermal 
stability. Combining H204R and T306K increased the half-life at 50 °C 10- 
fold, and combining M289L and F309N with these two mutations increased 
the half-life at 50 °C another 3-fold to 240 min (a 30-fold increase relative to 
the half-life of the starting enzyme; Table 9). 

[0307] The half-lives of His 6 H- RT and all its mutants dropped off at 55 °C 

relative to 50 °C (Table 9). The magnitude of the relative increase in intrinsic 
half-life observed at 50 °C for the quadruple mutant was maintained partially 
at 55 °C (Table 9). The half-life of His 6 H- RT of 1.6 min at 55 °C was 
increased 5-fold by H204R M289L T306K F309N to 8.1 min (Table 9). 

[0308] The presence of an N-terminal tag was seen to increase thermal 

stability of the RTs examined. A comparison of an RT with no N-terminal tag 
(Superscript™ II) with an RT having the same point mutations and having 
an N-terminal tag with the sequence 

MGGSHHHHHHGMASMTGGQQMGRDLYDDDDKH corresponding to 
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amino acids 1-32 of SEQ ED NO:2 (His 6 H- RT) shows that the presence of 
the tag increases the thermal stability of the RT. The presence of the tag 
increased the half-life of the RT at 50 °C by a factor of 2.5-fold (from 3.2 
minutes to 8 minutes). A comparison of the triple mutant H204R T306K 
F309N with and without tag showed an increase in the half-life of the enzyme 
at 50 °C by a factor of 3.5-fold (30 minutes to 105 minutes). Thus, the present 
invention contemplates RTs with an increased thermal stability that comprise 
one or more amino acids added to the N-terminal of the RT. In addition, the 
invention contemplates RTs with enhanced thermal stability that comprise one 
or more amino acids added to the C-terminal of the RT. 
[0309] Enzymatic Characterization of Purified Mutant RTs - A number of 

catalytic properties of the purified RT mutants were compared to ascertain if 
introduction of all four point mutations in RT altered its DNA synthetic 
activity. With one exception, all of the thermal stable mutants characterized 
had RNA-directed DNA polymerase specific activities greater than that of 
starting His 6 H- RT, but within a factor of 1.5-fold (Table 10). 



Table 10 

RNA-directed DNA polymerase specific activity of purified RT mutants 



Enzyme 


DNA Polymerase Specific Activity 
(units/jxg) 


His 6 H- RT 


281 (1.0) a 


His 6 H-H204RRT 


325(1.16) 


His 6 H- T306K RT . 


402(1.43) 


His 6 H- F309N RT 


257 (0.91) 


His 6 H- M289LRT 


339(1.21) 


His 6 H- H204R T306K RT 


385 0.37) 


His 6 H- H204R T306K F309N RT 


391(1.39) | 


His 6 H- H204R M289L T306K F309N RT 


410(1.46) 



'Ratio of specific activity setting that of His 6 H- RT at 1.0. 



[0310] His 6 H- F309N RT had a slightly reduced DNA polymerase specific 

activity (Table 10). The catalytic efficiency (KJK^ of His 6 H- RT and His 6 
H- H204R M289L T306K F309N were within a factor of two of each other 
and their Kms for nucleotide substrate were similar (Table 1 1). 
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Table 11 

Catalytic constants of purified RT mutants' 



Enzyme 


K m (nM) 


ka* (sec" 1 ) 


k ca t/K m 


His 6 H- RT 


390 ± 98 


45 + 18 


0.115 


His 6 H- H204R M289L T306K F309N RT 


274 ± 92 


46+ 11 


0.17 



a Mean ± standard deviation of three determinations 



[0311] The monovalent and divalent metal ion optima of Hiss H- RT and His6 

H- H204R M289L T306K F309N RT were determined. CAT cRNA*DNA 
24-mer in excess over enzyme was used as template-primer (Experimental 
Procedures). The optima were the same: 75 mM KC1 and 3 mM MgC^ at 37 
°C for His 6 H- RT and His 6 H- H204R M289L T306K F309N and at 50 °C for 
the quadruple point mutant. Taken together these results indicate that the four 
mutations identified that substantially increased the thermal stability of H- RT 
did not appreciably affect its DNA polymerase catalytic capability. 

[0312] In the presence of a template-primer, the half-life of M-MLV H- RT at 

50 °C is increased by a factor of 3 to 4 relative to the half-life in the absence of 
nucleic acids. This is the result of the enzyme being more resistant to heat 
inactivation when bound to template-primer than when unbound in solution. 
Mutations that impact the affinity of RT for template-primer will affect the 
apparent thermal stability of RT when it is engaged in cDNA synthesis. 
Therefore the affinities (Kv) of His 6 H- RT and several of its mutants for CAT 
cRNA#(dT) 2 o were determined (Table 12). 



Table 12 

Nucleic acid dissociation constants of purified RT mutants 



Enzyme 


K D (nM) a ' b 


His 6 H- RT 1 


8 ±0.5 


His 6 H- H204R RT 


35 ±0.5 


His 6 H- T306K RT 


6.6 ± 0.4 


His 6 H- F309N RT 


28 ± 2.5 


His 6 H- M289L RT 


4.7 ± 0.7 


His 6 H- H204R T306K RT 


6.1 ± 1.5 


His 6 H- H204R T306K F309N RT 


7.3 ± 2.4 


His 6 H- H204R M289L T306K F309N RT 


11 ± 1.0 



"Mean ± standard deviation of two or three determinations 
b The nucleic acid was CAT cRNA«(dT) 20 
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[0313] The K D of the His 6 H~ RT quadruple mutant was increased somewhat 

from 8 nM to 1 1 nM, indicating that nucleic acid binding affinity was reduced 
slightly by the four point mutations together. Interestingly two of the point 
mutations when present alone, H204R and F309N, reduced the binding 
affinity of His 6 H- RT substantially more, having a K D of 35 and 28, 
respectively. The effects of these mutations on binding were apparently 
counter balanced by mutation T306K (K D = 6.6 nM), since the triple mutant 
H204R T306K F309N had a K D of 7.3 nM. We conclude that when the triple 
(H204R T306K F309N) or quadruple mutant of His 6 H- RT is engaged in 
cDNA synthesis, any increase in apparent thermal stability observed relative 
to His6 H- RT is not due to increased protection by virtue of binding more 
tightly to template-primer. 

[0314] Practical Implication of Higher RT Thermal Stability - To assess the 

practical impact of the increases in thermal stability imparted by the H- RT 
mutations selected in this study, we measured the ability of some of the 
mutants to synthesize full-length CAT cDNA between 40 and 55 °C (Table 
13). 



Table 13 

Synthesis of full-length cDNA product from CAT cRNA by purified RT 
mutants at elevated temperatures 8 



Enzyme 


Amount of Full Length Product (ng) 
Synthesized at 




40°C 


45°C 


50°C 


52.5°C 


55°C 


His 6 H- RT 


517 


523 


229 


40 


2 


His 6 H- H204R RT 


521 


446 


146 


36 


17 


His 6 H- T306KRT 


456 


512 


338 


122 


28 


His 6 H- H204R T306K RT 


664 


528 


330 


202 


43 


His 6 H- H204R T306K F309N RT 


488 


556 


396 


209 


53 


His 6 H- H204R M289L T306K F309N RT 


540 


547 


539 


376 


120 















a cDNA synthesis reaction mixtures (as described above) contained 9.3 pmoles 
(2.6 jxg) of CAT cRNA. The amounts of full-length products were established 
by cutting the region corresponding to the size of each full-length band from a 
dried alkaline 1.2% agarose gel and counting it in a scintillation counter. 



V 



i 
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Reaction mixtures contained 2 pmoles of RT and were incubated at the 
temperatures indicated for 60 min. The results of a single experiment are 
shown. Similar results were obtained in one other experiment. 

[0315] The amount of RT relative to template-primer was limiting in these 

reactions. In the presence of limiting RT, failure to achieve full-length cDNA 
synthesis as the temperature is increased is an indicator of thermal inactivation 
of the enzyme under cDNA synthesis conditions. With one exception at 
temperatures above 50 °C, the amount of full-length CAT cDNA synthesized 
by a particular RT mutant (Table 13) correlated well with the thermal stability 
(half-life) of the enzyme at 50 °C (Table 9). The one exception is mutant 
H204R. It produced less full-length cDNA than His 6 H- RT at 50 and 52.5 °C, 
in spite of having a 5-fold greater half-life at 50 °C. This is probably due to 
the weaker binding of H204R than H- RT to CAT cRNA^DNA, diminishing 
thermal protection by template-primer. All the RTs tested maintained activity 
at 45 °C relative to 40 °C, but with the exception of the quadruple mutant, all 
RTs synthesized less full-length CAT cDNA at 50 °C and above. The His 6 H- 
RT quadruple mutant was folly active at 50 °C and at 55 °C synthesized 60 
times more full-length CAT cDNA than His 6 H- RT. This was 22% of the 
amount synthesized by both enzymes at 40 °C. 

[0316] The methods of the present invention can be used to engineer a 

polypeptide to have a desired characteristic, for example, a desired level of 
thermostability. The nucleic acid sequence of the polypeptide may be sub- 
divided into segments and the segments individually mutagenized, for 
example, using PCR random mutagenesis. The individual segments may then 
be re-assembled and used to express a mutated polypeptide that is screened for 
the desired activity. The Examples above provide working embodiments of 
the invention in which the rather large RT gene (2154 bases) was randomly 
mutated in segments by PCR random mutagenesis (Leung, et al, (1989) 
Technique 7, 1 1-15. Cadwell, et al. 9 (1992) PCR Methods and Applications 2, 
28-33). It may be desirable to sub-divide the sequence of the polypeptide of 
interest into fragments that correspond to functional or structural domains of 
the polypeptide of interest if such domains are known. In the working 
embodiments above, the segments roughly corresponded to the coding 
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sequences of the five separate structural subdomains of RT (Kohlstaedt, et al, 
(1992) Science 256, 1783-1789. Jacobo-Molina, et al, (1993) Proa Natl 
Acad. Set USA 90, 6320-6324). Segments one through four corresponded to 
the polymerase subdomain of fingers, palm, thumb, and connection, 
respectively, and segment five corresponded to the RNase H domain (Fig. 
2A). 

[0317] Using the materials and methods of the present invention, one skilled 

in the art can construct a modified RT enzyme having a desired level of 
thermostability. In the examples presented above, the operating temperature 
of H- M-MLV RT was increased to at least 10 °C above 45 °C. This target 
temperature was selected as a compromise: high enough to help reduce RNA 
secondary structure and low enough to avoid RNA breakdown. Although, the 
rate of chemical RNA breakdown catalyzed by Mg""" increases dramatically as 
the temperature is increased above 55 to 60 °C under cDNA synthesis reaction 
conditions, it may be desirable in certain instances (e.g., RNA with 
pronounced secondary structure) to conduct a reverse transcription reaction at 
a temperature. higher than 55 °C, for example, at about 60 °C, or at about 65 
°C, or at about 70 °C. Using the methods described above a suitable RT can 
be constructed using routine experimentation. 

[0318] Reaction conditions for the mutagenesis reaction may be adjusted to 

introduce the desired number of mutations in each segment. In the present 
study, an upper limit cut-off of 1 to 2 mutations per segment was set as the 
target for mutation frequency to suppress accumulation of deleterious 
mutations (Lehmann, et al., (2001) Current Opinion in Biotechnology 12, 371- 
375), and to minimize the amount of screening required to find active mutants. 
Those skilled in the art will appreciate that conditions may be adjusted in 
order to induce a greater mutation frequency, for example, by adjusting the 
concentration of Mn 2+ , pH, salt concentration etc. Mutation frequencies of > 5 
mutations/segment in segment one, two, or three produced only about 5% 
active mutants with all the mutants having less than wild-type activity. At the 
mutation frequency used of 1 to 2 mutations per segment, approximately one- 
third of the mutants had little or no activity, one-third had less than 50% of 
His 6 H- RT activity, and one-third had up to 100% of His 6 H- RT activity. 
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[0319] Polypeptides produced from the mutated sequences are screened for a 

desired activity. The polypeptides may be screened from crude extracts, 
partially purified extracts or from purified polypeptide preparations. In the 
Examples above, crude extracts of mutants were screened in a 96-well plate 
format with template-primer specific for RT activity (Gerard, et a/., (1974) 
Biochem. 13, 1632-1641). The procedure used to produce E, coli cell crude 
extracts permeabilized rather the lysed the cells, releasing proteins and small 
RNAs to the outside of the cell matrix, while keeping DNA inside. This made 
representative sampling and assay with an exogenous template-primer of the 
cell suspension feasible. The signal-to-noise ratio of the assay was such that 
to differentiate starting from mutant cells at least a two-fold difference in 
activity was required. Those skilled in the art will appreciate that other 
methods of screening, for example, by first purifying the mutated polypeptides 
of interest (e.g., RTs) will permit detection of smaller changes in activity level. 

[0320] Utilizing this approach, two mutations were found in segment three 

(thumb) and one in segment two (fingers). A rational sequence homology 
approach combined with site-directed mutagenesis was used to identify a 
fourth mutation located in the fingers subdomain. The small number of 
mutations found and the failure to identify mutations in segments one and four 
were probably due to limitations of both the random mutagenesis approach 
and the screen. The mutation frequency cut off of 1 to 2 mutations per 
segment imposed on the random mutagenesis limited the mutant space 
available in the mutant population. The inability of the assay to detect less 
than a two-fold difference in activity probably made many active and slightly 
more thermal stable mutants undetectable. Since the RNase H domain of 
wild-type M-MLV RT is already substantially more thermal stable than the 
polymerase domain (Verma, I. (1975) 7. Virol 75, 843-854), it was unlikely 
that stabilizing mutations introduced in segment five would be picked up in a 
screen for more thermal stable polymerase activity. 
[0321] In spite of the limited number of mutations found with the approach 

used, those that were identified had a significant positive impact on RT 
thermal stability. The thermal stability of His 6 H- M-MLV H204R M289L 
T306K F309N RT was increased 30 fold at 50 °C. The RT operating 
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temperature was increased 7 °C to 55 °C. Three of the four mutations alone 
increased the RT half-life 1.2 to 1.6-fold and the fourth mutation gave a 5-fold 
increase. A well established operating principle in protein stabilization is that 
the total stabilization obtained by multiple mutations is the sum of the effects 
of individual mutations, assuming each mutation exerts its effect independent 
of the other mutations (Wells, J.A. (1990) Biochem. 29, 8509-8517). The 
thermal stabilizing effects of the four mutations appear to be additive, but an 
additional contribution to increased thermal stability due to cooperative 
interaction between mutations cannot be excluded. 

[0322] Discussion of the mechanisms whereby each of these four mutations 

affect the thermal stability and template-primer binding affinity of M-MLV 
RT requires a 3 -dimensional structural framework. In the case of the mutation 
at H204 in the M-MLV RT palm subdomain, this is easily accomplished 
because the structure of a fingers-palm fragment of M-MLV RT is known 
(Georgiadis, et al 9 (1995) Structure 3, 879-892). For T306K and F309N 
located in the thumb, we must rely on 3-dimensional structural data taken 
from HIV RT (Kohlstaedt, et al, Jacobo-Molina, et al. 9 Ding, et al, (1998) J. 
Mol. Biol. 284, 1095-1 1 1 1, and Huang, et al, (1998) Science 282, 1669-1675). 
Fortunately in this region of the thumb subdomain of HIV RT (T253 to P272) 
and the corresponding region of M-MLV RT (T296 to P315), the amino acid 
sequence is reasonably well conserved. M-MLV RT can be predicted to have 
an oc-helix in this region that resembles the structure of HIV RT a-helix H 
(Kohlstaedt, et al 9 Jacobo-Molina, et al 9 Kneller, et al 9 (1990) J. Mol Biol. 
214 9 171-182, and Rost, B. (1996) Methods Enzymol 266, 525-539.) 

[0323] Together the four RT amino acid changes exerted their stabilizing 

effect without significantly altering the DNA synthesizing catalytic activity of 
the enzyme. However two mutations, H204R and F309N, did significantly 
reduce RT binding to template-primer (Table 12). M-MLV RT H204R is 
located in the middle of the long a-helix H (aH) on the backside of the palm 
subdomain, directly below 0-sheets 10-11 (010-011) (Fig. 9, Georgiadis, et 
al, supra). Beta-sheets 10-11 contain the M-MLV RT polymerase catalytic 
site (YVDD) at residues 222 to 225 on the inside of the palm that contact the 
template-primer (Georgiadis, et al 9 supra). Changes in H204R could directly 
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affect packing of amino acid residues near it in otH and P 10-0 11, thus 
influencing how piO-pl 1 interact with template-primer. Or its effect could be 
translated along the polypeptide backbone since the a-helix containing H204 
and the P-sheet containing Y222 are separated by a single short turn (Fig. 9). 
As discussed above based upon an amino acid homology comparison of M- 
MLV and HIV RT and from crystal structures of HIV RT (Kohlstaedt, et al, 
supra, Jacobo-Molina, et al, supra, Ding, et al, supra, Huang, et al, supra, 
Kneller, et al, supra, Rost, Bebenek, et al, (1997) Nature Structural Biology 
4, 194-197, and Beard, et al, (1998) J. Biol Chem. 273, 30435-30442), F309 
in M-MLV RT can be predicted to be part of a minor groove binding track in 
an a-helix of the RT thumb subdomain. In HIV RT this structure contributes 
significantly to RT binding to template-primer (Bebenek, et al, supra). The 
aromatic phenylalanine at position 309 in M-MLV RT corresponds to 
tryptophan 266 in HIV RT (Bebenek, et al, supra). Extensive amino acid 
substitution studies of HIV RT W266 show that any substitution at this 
position reduces the binding affinity of RT for template-primer (Beard, et al, 
supra). We postulate that alteration of M-MLV RT F309 to asparagine 
impacts the structure of a putative M-MLV RT minor groove binding track 
resulting in a reduction in binding affinity for template-primer. Interestingly 
T306K was able to compensate for the reduced binding produced by H204R or 
H204R and F309N together, restoring template-primer binding affinity to the 
original level (Table 12). Again based upon amino acid homology in this 
region, T306 in M-MLV RT corresponds to a lysine in position 263 in the oc- 
helix of HIV RT that contains the minor groove binding track. The crystal 
structure of HIV RT complexed with double-stranded DNA shows that K263 
contacts the n-2 phosphate of the DNA primer (Ding, et al, supra, Huang, et 
al, supra). Introduction into M-MLV RT of a positively charged lysine in 
place of phenylalanine at position 306 compensates for the reduced binding of 
H204R and F309N, perhaps by enhancing RT binding at the n-2 primer 
phosphate position. 

[0324] H204R had the greatest individual thermal stabilizing effect on H- M- 

MLV RT. At position 204 histidine has been replaced with a more highly 
charged arginine in a-helix H at the underside of the palm subdomain 
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(Georgiadis, et al., supra). Examination of the 3 -dimensional structural model 
of the fingers-palm fragment of M-MLV RT (Georgiadis, et al, supra) reveals 
this substitution places arginine at a proper distance to form a salt bridge with 
E201 in a-helix H or a hydrogen bond with T128 in the loop between P-sheet 
6 and a-helix E (Fig. 9). Either event would help stabilize the RT molecule 
(Kumar, et al. 9 (2001) Cell. Mol Life Sci. 58, 1216-1233). 
[0325] The mutation at position 306 from threonine to lysine probably 

contributes to the thermal stability of M-MLV RT by making an electrostatic 
contribution to stabilizing the polypeptide backbone. Saturation of this site 
with all possible amino acids showed that only substitution of arginine or 
lysine increased M-MLV RT thermal stability. Both of these amino acids 
could allow for hydrogen bonding to main chain carbonyl groups and amino 
acid side chains in a loop adjacent to the putative a-helix containing residue 
306. 

[0326] Predictions about the mechanism whereby F309N increases M-MLV 

RT thermal stability are complicated by its proposed role in template-primer 
binding (see discussion above). M-MLV RT position 309 was also examined 
by saturation with all possible amino acid changes. Only serine and 
asparagine increased thermal stability. Substitution of charged amino acids 
for the exposed hydrophobic phenylalanine, which in other protein families 
correlates well with amino acid changes observed in transitioning from 
mesophilic to thermophilic family representatives (Kumar, et al., supra, 
Cambillau, et al., (2000) J. Biol. Chem. 275, 32383-32386), reduced or 
eliminated polymerase activity. Correlation of increased thermal stability with 
substitution of polar for exposed hydrophobic amino acids is observed much 
less frequently in protein families (Kumar, et al., supra, Cambillau, et al, 
supra), so that the structural basis for the stabilizing effect of F-»N at position 
309 is not clear. 

[0327] In the absence of both 3-dimensional structural information and 

reasonable sequence homology between HIV RT and M-MLV RT in the 
region of M-MLV RT M289, it is difficult to predict the structural basis for 
the contribution of M289L to M-MLV RT thermal stability. As a general 
tendency, thermally stable proteins have more and larger hydrophobic amino 
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acids than their mesophilic counterparts that are better able to exclude water 
from the protein core (Cambillau, et al. 9 supra). It is possible that M~»L at 
position 289 increases M-MLV RT thermal stability through allowing better 
hydrophobic packing by replacing an unbranched methionine with a P- 
branched leucine (Cambillau, et aL, supra). 
[0328] Consistent with the results of other studies focused on identifying 

mutations that increase protein thermal stability (Arnold, et al 9 (2001) Trends 
in Biochemical Sciences 26, 100-106), the mutations identified in this study 
probably reside on the RT surface and influence thermal stability without 
decreasing catalytic activity. In the case of two of the stabilizing amino acids, 
T306 and F309, they probably reside on a protein surface that contacts 
template-primer and they bind in such a way that their effects on binding are 
off setting. 

[0329] The present invention provides materials and methods for engineering 

a polypeptide to contain a desired characteristic. As an example, the M-MLV 
RT gene was mutated to increase the thermostability of the enzyme. The goal 
of this research was to increase the temperature at which M-MLV RT 
efficiently catalyzes cDNA synthesis from 48 °C to about 55 °C to about 60 
°C by increasing the intrinsic thermal stability of RT. The ability to use 
retroviral RT at 55 to 60 °C increases both the efficiency and specificity of 
priming of cDNA synthesis. With the mutagenesis and screening approaches 
used, mutations were identified that increased the M-MLV RT operating 
temperature limit to at least about 55 °C. These mutations created a dramatic 
increase in intrinsic thermal stability measured as a half-life increase at 50 °C 
(30-fold), that was maintained partially at 55°C (5-fold increase in half-life). 
There are alternative approaches that could potentially build on the four 
mutants already identified. For example, the target mutation frequency could 
be set much higher (5-10 mutations/segment) during random mutagenesis of 
several larger segments of the H- M-MLV RT quadruple mutant polymerase 
domain, giving a much smaller proportion of active mutants but a much 
broader mutant spectrum. Several libraries generated in this fashion could 
then be shuffled by recombination (Stemmer, W.P.C. (1994) Nature 370 9 389- 
391), which would tend to eliminate deleterious mutations (Lehmann, et al. 9 
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supra). When combined with a 96-well DNA polymerase activity screen 
made more sensitive and a sampling procedure used to assay multiple mutants 
per well to increase the number of mutations screened, such an approach 
should yield additional mutations that further increase the thermal stability of 
M-MLV RT. 

EXAMPLE 7 

Engineering of Superscript™ HI RT, Thermal Stability, cDNA synthesis. 

The gene for Moloney murine leukemia virus (M-MLV) RNase H- 
minus (H-) reverse trenscriptase (RT) 5 also known as Superscript™ II was 
randomly mutagenized and mutants were screened for increased thermal 
stability. Four mutations were identified that together increased the half-life 
of M-MLV H- RT at 50°C 35-fold and raised the RT cDNA synthesis 
operating temperature from 42°C to 55°C. This increase in thermal stability 
was achieved by increasing the intrinsic thermal stability of M-MLV H- RT 
and without diminishing the DNA polymerase catalytic activity of the enzyme. 

Reverse transcriptase (RT, e.g., retroviral RT) is an essential tool for 
the synthesis and cloning of cDNA. Forms of retroviral RT used widely to 
synthesize cDNA are M-MLV RT and AMV RT. In addition to polymerase 
activity, retroviral RT possesses RNase H activity that degrades the RNA in an 
RNA-DNA hybrid (Moelling, et al, (1971) Nature New Biology 234, 240- 
244). The presence of this degradative activity is responsible in part for the 
limitation on efficient synthesis of long cDNA (Krug and Berger (1987) 
Methods in Enzymol 152, 316-325; Berger, et al, (1983) Biochem. 22, 2365- 
2372). The RNase H domain of RT can be mutated to reduce or eliminate 
RNase H activity while maintaining mRNA-directed DNA polymerase activity 
(Kotewicz, et al, (1988) Nuc. Acids Res. 16, 265-277; DeStefano, et al, 
(1994) Biochim. Biophys. Acta 1219, 380-388), improving the efficiency of 
cDNA synthesis (Kotewicz, et al, supra). This has been done in 
Superscript™ II and ThermoScript. 

A second significant drawback to copying mRNA is the tendency of 
RT to pause during cDNA synthesis resulting in the generation of truncated 
products (Harrison, et al, (1998) Nuc. Acids Res. 26, 3433-3442; DeStefano, 
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et aL, (1991) J. Biol Chem. 266, 7423-7431). This pausing is due in part to 
the secondary structure of RNA (Harrison, et aL, supra, Wu, et aL, (1996) J. 
Virol 70, 7132-7142). Performing cDNA synthesis at reaction temperatures 
that melt the secondary structure of mRNA helps to alleviate this problem 
(Myers and Gelfand, (1991) Biochem. 30, 7661-7666). In addition, the 
oligo(dT)n primer often used to initiate cDNA synthesis tends to prime at 
internal stretches of A residues in mRNA at lower temperatures, resulting in 
the synthesis of 3 '-end truncated cDNA products. M-MLV RT does not 
efficiently synthesize cDNA from mRNA above 43 °C (Tosh, et aL, (1997) 
Acta Virol 41, 153-155). 

[0333] In an effort to raise the temperature at which Superscript™ II RT can 

be used to synthesize cDNA, we have randomly mutagenized the 
Superscript™ II RT gene and screened for thermal stable mutants. Several 
thermal stable mutants of Superscript™ II RT were identified and purified 
enzymes were characterized. We show that when the mutations are present 
together they increase RT thermal activity by increasing its intrinsic thermal 
stability without altering catalytic activity. 

[0334] Superscript™ III RT Purification: The Superscript™ III RT gene 

was derived from Superscript™ II RT. Four mutations, in addition to the 3 
RNase H mutations present in Superscript™ II, were included along with 
modifications to the N-terminus to increase the thermostability. The gene was 
cloned into plasmid pBAD (Invitrogen Corporation, Carlsbad, CA) under 
control of an araD promoter. The purified protein has an apparent molecular 
weight of 78 kDa. The plasmid was transformed into E. coli DH10B 
(Invitrogen Corporation, Carlsbad, CA) cells and a seed stock was grown up to 
an O.D. of 1.0 in EG (2% Tryptone, 1% yeast extract, 0.5% glycerol, 10 mM 
NaCl, and 1 mM KC1) and frozen in 60% S.O.B./40% glycerol at -80 °C. 

[0335] For purification, a stab from the frozen seed was used to inoculate a 

starter culture of 500 ml. The media used for the R&D prep was either EG or 
CircleGrow (Biol 01), supplemented with 100 |ig/ml Ampicillin. The starter 
culture was incubated overnight at 37°C and used to inoculate 10 liters EG + 
Amp media. The culture was incubated at 37°C to an OD 6 oo of 1.0 (3-4 
hours), induced with 0.2% arabinose, and incubation continued another 3 
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hours. Cells were pelleted by centrifugation at 5,000g for 20 minutes at 4°C. 
The supernatant was removed and cells were either frozen at -80°C or 
processed. 

[0336] Purification of SUPERSCRIPT™ III was done the same as 

Superscript™ II and is outlined as follows, with the exception of using an 
AF-Heparin 650M resin (TosoHaas) in place of Heparin Agarose as the third 
column. All buffers must be autoclaved and columns thoroughly cleaned to 
prevent RNase contamination. The procedure is done at 4°C unless otherwise 
indicated. 

[0337] Purification Method A. Thaw 50 grams of cells in 100 ml Buffer A (2 

ml per g of cell pellet: 20 mM Tris-HCL pH 7.5, 25 mM NaCl, 1 mM EDTA, 
1 mM DTT, 1.3 mM PMSF), stirring at 4°C until homogenous. Add another 
0.78 ml 0.1 M PMSF and mix well (5 minutes). Strain cells through 6 layers 
of cheesecloth. Filter the sample through gaulin pre-equilibrated with 1 
volume Buffer A (avoid bubbles and particulates in gaulin) at 4-6°C with 
pressure at 8000 psi. Repeat and check sample for > 80% lysis. Determine 
volume of cells and add 0.053 V of 5M NaCl to lysate. Stir at 4°C till mixed. 
Add 0.111 V of 5% polymin P (pH 7.9) over 30 minutes while mixing and 
then mix for another 30 minutes. Centrifuge at 18,100g for 60 min at 4°C, 
keep the supernatant. Add 226 g/L (NFL^SC^ to supernatant over 30 minutes 
while stirring and mix for another 30 minutes. Centrifuge at 18,100g for 60 
min at 4°C, keep the supernatant. Resuspend the pellet in Buffer B (20 mM 
Tris-HCL pH 7.5, 100 mM NaCl, 1 mM EDTA, 1 mM DTT, 0.01% NOG, 5% 
glycerol) at 0.52 ml/g original pellet weight (26 ml) while stirring for 1 hour. 

[0338] A G-25 F column (AP-Biotech, 270 ml resin volume) was equilibrated 

with Buffer B and then the sample was loaded. The column was washed and 
the sample eluted with 2 column volumes of Buffer B at a flow rate of 3.3 
ml/min and fraction volume of 6 ml/tube. Fractions were pooled based on UV 
absorbance and a clear amber color. If precipitation formed, the pool was 
centrifuged at 18,100g for 30 min at 4°C. 

[0339] The fraction pool was loaded onto a P-ll column (Whatman: 2x 

cycled, 25 ml resin volume) pre-equilibrated with Buffer B at a flow rate of 
1 .0 ml/min. The column was washed with 10 column volumes of Buffer C (20 
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mM Tris-HCL pH 7.5, 100 mM NaCl, 1 mM EDTA, 1 mM DTT, 0.01% NP- 
40, 5% glycerol) and eluted with 12 column volumes of a 100 to 500 mM 
NaCl gradient (Buffer C and 0 to 100% of Buffer D: 20 mM Tris-HCL pH 7.5, 
500 mM NaCl, 1 mM EDTA, ImM DTT, 0.01% NP-40, 5% glycerol) at a 
flow rate of 1.6 ml/min and fraction volume of 4 ml. Fractions were pooled 
by either OD 2 80 >50% peak height or >50% peak RT activity. The fraction 
pool was then diluted with one volume of Buffer E (20 mM Tris-HCL pH 7.5, 
1 mM EDTA, ImM DTT, 0.01% NP-40, 5% glycerol). 

[0340] The diluted pool was then loaded onto an AF-Heparin 650M column 

(TosoHaas, 12.5 ml resin volume) pre-equilibrated with Buffer F (20 mM 
Tris-HCL pH 7.5, 160 mM NaCl, 1 mM EDTA, ImM DTT, 0.01% NP-40, 
5% glycerol) at a flow rate of 0.7 ml/min. 

[0341] The column was washed with 10 column volumes of Buffer F (20 mM 

Tris-HCL pH 7.5, 160 mM NaCl, 1 mM EDTA, 1 mM DTT, 0.01% NP-40, 
5% glycerol) and eluted with 10 column volumes of a 160 to 700 mM NaCl 
gradient (Buffer F and 0 to 100% of Buffer G: 20 mM Tris-HCL pH 7.5, 700 
mM NaCl, 1 mM EDTA, ImM DTT, 0.01% NP-40, 5% glycerol) at a flow 
rate of 0.5 ml/min and fraction volume of 2 ml. Fractions were pooled by 
either OD 2 go >50% peak height or >50% peak RT activity. The fraction pool 
was then diluted with three volumes of Buffer H (20 mM Tris-HCL pH 7.5, 
100 mM NaCl 1 mM EDTA, ImM DTT, 0.03% NP-40, 5% glycerol). 

[0342] The diluted pool was then loaded onto an SP Sepharose HP column 

(Ap Biotech, 12.5 ml resin volume) pre-equilibrated with Buffer I (20 mM 
Tris-HCl pH 7.5, 100 mM NaCl, 1 mM EDTA, ImM DTT, 0.01% NP-40, 5% 
glycerol) at a flow rate of 1.2 ml/min. The column was washed with 10 
column volumes of Buffer I (20 mM Tris-HCL pH 7.5, 100 mM NaCl, 1 mM 
EDTA, 1 mM DTT, 0.01% NP-40, 5% glycerol) and eluted with 10 column 
volumes of a 100 to 280 mM NaCl gradient (Buffer I and 0 to 40% of Buffer 
G) at a flow rate of 0.6 ml/min and fraction volume of 2 ml. Fractions were 
pooled by either OD 2 so >50% peak height or >50% peak RT activity. 

[0343] The RT containing fractions from the Sepharose column were dialyzed 

against 20 volumes Buffer J (20 mM Tris-HCL pH 7.5, 100 mM NaCl, 0.1 
mM EDTA, 1 mM DTT, 0.01% NP-40, 50% glycerol) overnight at 4°C. 
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[0344] Purification Method B. One hundred grams of cells is suspended in 80 

ml of permeabilization buffer (500 mM Bis-Tris, pH 7.0 at 4°C/50 mM 
EDTA/5 mM DTT). (Note: The DTT can be added as a separate component 
and the volume of water added adjusted accordingly.) 79.6 ml of H 2 0 is 
added and the cell suspension is stirred with an overhead stirrer at -700 rpm 
with a Heidolph mixer or at 6+ with a StirPak mixer using the three blade 
mixer shaft. The suspension is checked to ensure that it is homogeneous by 
stopping the mixer and pouring the suspension into a glass beaker slowly 
while looking for clumps. Once thoroughly mixed, 80 ml 25% Triton X- 
100/10% deoxycholic acid (at ambient temperature) is added while mixing 
vigorously (as above). 400 ul of 100 mM PMSF is added and the suspension 
is allowed to stand with mixing for 30 min at 4°C. 60 ml of 2 M (NH4)2S04 is 
added and mixing is continued for an additional 15 min at 4°C. Samples can 
now be taken or proceed to the filtration stage. 
[0345] Volumes: 

Cells 100 ml 

Buffer 80 ml 

H 2 0 79.6 ml 

Detergent 80 ml 

PMSF 0.400 ml 

(NH 4 ) 2 S0 4 60 ml 

" 400 ml 

[0346] Filtration. The cell slurry is poured into a circulation vessel and a flow 

rate of 1.5 L/min/ft2 through a 0.2 um Spectrum mixed ester cellulose hollow 
fiber filter with a 1.0 mm lumen is established. Extraction is performed with 
seven volumes of 100 mM TRIS, pH 8.0/300 mM ammonium sulfate/10 mM 
EDTA/1 mM DTT/10% glycerol (v/v) while maintaining 4-6°C and a TMP of 
5-10 psi. 

[0347] Once enough filtrate has been collected, the second stage filtration 

using an AG/Tech 30,000 MWCO hollow fiber with a maximum inlet pressure 
of 50 psi. (all operations chilled to 4-6°C) is initiated. Once all seven volumes 
of extraction have been collected, the solution is concentrate to 600 ml and 
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diafiltered against five volumes of 100 mM Tris, pH 8.0/100 mM NaCl/10 
mM EDTA/1 mM DTT. 
[0348] The diafiltered retentate is collected and 50 ml previously equilibrated 

DEAE cellulose is added. The suspension is mixed gently for 15 min at 4°C. 
The resulting solution is clarified by filtration through a CUNO 30 SP depth 
filter. 

[0349] Chromatography. The above solution is applied to a previously 

equilibrated 120 ml column of Macroprep High S (20 mM Tris, pH 8.0/150 
mM NaCl/10% glycerol/1 mM EDTA/1 mM DTT/ 0.01% Triton X-100). The 
column is washed with 10 column volumes of equilibration buffer and eluted 
with a linear gradient of equilibration buffer to the same buffer at 800 mM 
NaCl. 50 fractions are collected and the major uv peak is pooled. 

[0350] The pool is applied to a previously equilibrated 120 ml column of 

Ceramic hydroxy apatite (20 mM Kpi, pH 7.0/100 mM KCl/10% 
glycerol/1 mM DTT/0.01% Triton X-100) at 20 cm/h. Elution is with this 
same buffer and 10 ml fractions are collected. The uv peak is pooled and an 
equal volume of solubilization buffer (100 mM Tris, pH7.5/300 mM NaCl/0.2 
mM EDTA/30% glycerol/1 mM DTT) is added. 

[0351] The pool is applied to a previously equilibrated 16 ml column of EM 

COO- (20 mM Tris, pH 7.5/100 mM NaCl/0.1 mM EDTA/20% glycerol/1 
mM DTT/0.01% Triton X-100) at 20 cm/h. The column is washed with two 
column volecules of equilibration buffer. Elution is with a linear gradient of 
equilibration buffer to the same buffer with 200 mM NaCl. The uv peak is 
pooled and dialyzed against 20 volumes of 20 mM Tris, pH 7.5/100 mM 
NaCl/0.1 mM/1 mM DTT/0.01% Triton X- 100/50% glycerol. 

[0352] DNA Polymerase Assays: RT DNA polymerase unit activity was 

assayed with (rA) 6 3o # p(dT)i 2 -i8. One unit of DNA polymerase activity is the 
amount of RT that incorporates one nmole of deoxynucleoside triphosphate 
into acid insoluble product at 37°C in 10 min. Specific Activity is determined 
by dividing the Units/|al by the protein concentration to get units/mg. 

[0353] cDNA synthesis from CAT cRNA was carried out in reaction mixtures 

(20 nl) containing 50 mM Tris-HCl (pH 8.4), 75 mM KC1, 3 mM MgCl 2 , 10 
mM dithiotreitol (DTT), 500 \xM each of dATP, dTTP, dGTP, and [a- 
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32 P]dCTP (300 cpm/pmole), 1,750 units/ml RNase Inhibitor, 130 ^ig/ml (465 
nM) CAT cRNA, 20 jig/ml (2,300 nM) p(dT) 25 - 3 o, and 3,250 units/ml (100 
nM) RT. Incubation was at various temperatures for 60 min individual tubes. 
An aliquot of the reaction mixture was precipitated with TCA to determine 
yield of cDNA synthesized, and the remaining cDNA product was size 
fractionated on an alkaline 1.2% agarose gel. To establish mono- and divalent 
metal reaction optima, initial reaction rates were determined under conditions 
of limiting RT concentration during 10 min incubation at 37°C or 50°C. 
Reaction mixtures (20 |al) contained 50 mM Tris-HCl (pH 8.4), 10 mM DTT, 
500 |iM each of dTTP, dATP, dCTP, and [ 3 H]dGTP (100 cpm/pmole), 10 
pmoles (2.8 fig) CAT cRNA, 50 pmoles DNA 24-mer, 0.5 pmoles RT, and 
KC1 and MgCl 2 , varied in concentration one at a time. 
[0354] Half Life Determination: Mixtures (20 |il) were incubated for various 

times in 0.5-ml tubes in a thermocycler at 50°C or 55°C and contained 50 mM 
Tris-HCl (pH 8.4), 75 mM KC1, 3 mM MgCl 2 , 10 mM DTT, 0.1% (v/v) Triton 
X-100, and 3-7 fig/ml purified RT. Incubation was stopped by placing the 
tube in ice. An aliquot (5 |il) was assayed for residual activity with 
(rA) 6 3o # p(dT)i2-i8. 

[0355] Measurement of K D by Filter Binding: A nitrocellulose filter-binding 

assay was used to determine the nucleic acid binding constants (K D ) of RTs for 
CAT cRNA*(dT) 20 . Dephosphorylated CAT cRNA was labeled at the 5' end 
with [y- 32 P]ATP and T4 polynucleotide kinase (Boehringer). 01igo(dT) 2 o was 
annealed to the poly(A)-tailed CAT cRNA in a buffer containing 10 mM Tris- 
HCl, pH 7.5, and 80 mM KC1 at 65°C for 5 min followed by chilling on ice. 
Reaction mixtures (100 \xl) containing binding buffer (50 mM Tris-HCl, pH 
8.4, 75 mM KC1, 3 mM MgCl 2 , and 10 mM DTT), 0.05 nM 32 P-labeled CAT 
cRNA, 1 nM (dT) 20 , and 1 to 50 nM RT were incubated at 23°C for 5 min. 
After incubation, the mixture was filtered through a nitrocellulose filter 
(Millipore, HA 0.45 mm) soaked in binding buffer, which was then washed 
with binding buffer. The K D is equal to that enzyme concentration at which 
50% of the labeled CAT cRNA is bound. For this method of analysis to be 
valid, the CAT cRNA concentration in the reaction must be substantially 



- 154- 



below K Dy so that the total enzyme concentration approximates the 
concentration of free unbound enzyme. 
[0356] LacZa Fidelity measurements: Fidelity measurement were conducted 

using a modified lacZa gapped procedure from Boyer, J.C. et al. .{Methods in 
Enzymology, Vol 275, p 523-537). Briefly, a modified pUC19 plasmid 
containing the promoter for T7 RNA polymerase between nucleotides -112 
and -113 (where position +1 is the first transcribed nucleotide of the lacZa 
gene) was used. The plasmid was linearized with Fspl, and transcription by 
T7 RNA polymerase produces a 344 nucleotide transcript. Using this RNA as 
template, cDNA synthesis under desired conditions (200 units SUPERSCRIPT™ 
III RT or 15 units AMV H+ RT in their respective buffers) was initiated from 
a 15-mer DNA primer. After heat denaturation, the RNA was digested and an 
excess of cDNA was used for hybridization to a circular DNA substrate 
containing a complementary single-stranded gap. This substrate was made by 
cutting M13mpl9 RF with Pvul and PvuII in the appropriate buffer and 
isolating the large linear fragment. This fragment is then denatured and 
reannealed to a 2-fold excess of single-strand M13mpl9. The resulting DNA 
product is transfected into competent DH12S cells, plated onto LB indicator 
plates containing X-gal and IPTG and plaques are scored for color. Mutation 
frequency was determined by dividing the number of mutant plaques (light 
blue or white) by the total. The enzyme's mutation frequency is corrected by 
subtracting the mutation frequency of the starting DNA. The error rate is 
determined by dividing the mutation frequency by the number of known 
detectable sites (116) and then dividing by the probability of expressing the 
minus strand (50%). 

[0357] rpsL Fidelity Assay: Fidelity assay was performed based on the 

streptomycin resistance that rpsL mutatants exhibit (Fujii, et aL, (1999) JMol 
Biol. 289(4):835-50). Briefly, pMOL 21 plasmid DNA (4 kb), containing the 
ampicillin (Ap*) and (rpsL) genes, was modified with the insertion of a T7 
RNA promoter sequence downstream of the rpsL gene. The plasmid was 
linearized with Seal and this template was used for in vitro transcription of a 
1 .4 kb RNA containing the rpsL gene. A first strand cDNA reaction using the 
desired RT and reaction conditions was carried out, followed by second strand 
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synthesis using pol I, RNase H and ligase. This double-strand DNA was then 
cut with Hindlll and EcoRI and cloned back into the original pMOL 21 
plasmid and transformed into MF101 competent cells. Cells were plated on 
ampicillin plates to determine the total number of transformed cells. Cells 
were plated on ampicillin and streptomycin plates to determine the total 
number of rpsL mutants. Mutation frequency was determined by dividing the 
total number of mutations by the total number of transformed cells. The error 
rate was determined by dividing the mutation frequency by 130 (the number of 
amino acids that cause phenotypic changes for rpsL) and the template 
doubling. In addition rpsL was further modified by altering the sequence 
prone to mutations to remove template bias. r 

[0358] TDT activity: The template-primer was prepared by annealing a 47- 

mer template (either RNA or DNA): 

[0359] 5'- 

GAGTTACAGTGTTTTTGTTCCAGTCTGTAGCAGTGTGTGAATGG 
[0360] AAG -3' (SEQ ID NO:6) to a 18-mer DNA primer (5'- 

GAAC AAAAAC ACTGTAACTC-3 ' (SEQ ID NO: 10)) [ 32 P]-labeled at the 
5 '-end with T4 polynucleotide kinase (template:primer, 3:1). Assay mixture 
(10 contained 5 nM template-primer, 200 units each of either 

Superscript™ II or Superscript™ III, all 4 dNTPs, none or dCTP, dATP, 
dTTP, dGTP (250 uM each) as shown in the figure legends, 50 mM Tris-HCl 
(pH 8.3), 75 mM KC1, 3 mM MgCl 2 , 10 mM DTT. Reactions were incubated 
at various temperatures for 10 or 60 min and terminated by the addition of 5 jlxI 
of 40 mM EDTA, 99% formamide. Reaction products were denatured by 
incubating at 95°C for 5 min and analyzed by electrophoresis on urea 6% 
polyacrylamide gels. 

[0361] Half Life Determination: Mixtures (20 \\\) were incubated for various 

times in 0.5-ml tubes in a thermocycler at 50°C, 55°C, or 60°C and contained 
IX First Strand Buffer (50 mM Tris-HCl (pH 8.3), 75 mM KC1, 3 mM 
MgCl 2 ), 10 mM DTT, 3 mM MgCl 2 (total to 6 mM), 0.5 mM dTTP, 0.05% 
(v/v) NP-40, [methyl- 3 H]dTTP (20nCi/ml), and 5 ^1 of RT. Incubation was 
stopped by placing the tube in ice. An aliquot (5 jxl) was assayed for residual 
Unit activity with 5 mM poly(A)/3 mM oligo(dT) 25 . One unit of RT DNA 



- 156- 

polymerase activity is the amount of RT that incorporates 1 nmol of 
deoxynucleoside triphosphate into acid-insoluble product at 37°C in 10 min. 
[0362] Full-length cDNA profiling: cDNA synthesis from cRNA by 

Superscript™ III was carried out in reaction mixtures (20 jxl) containing IX 
First Strand Buffer (50 mM Tris-HCl (pH 8.3), 75 mM KC1, 3 mM MgCl 2 ), 10 
mM dithiotreitol (DTT), 500 ^M each of dATP, dTTP, dGTP, and dCTP, [oc- 
32 P]-dCTP (300 cpm/pmole), 2 units/^1 RNaseOUT, different amounts of 
RNA, and 0.5 jig oligo(dT) 2 5. The mix is prewarmed at various temperatures 
for 2 minutes. After warming, 200 or 400 units of SUPERSCRIPT™ III RT are 
added. The mixtures were incubated at the same temperature for another 50 
min in individual tubes. The reactions were stopped by adding 10 \i\ of 0.5 M 
EDTA. An aliquot (5 jj!) of the reaction mixture was precipitated with TCA 
to determine yield of cDNA synthesized, and the remaining cDNA product 
was size fractionated on an alkaline 1.2% agarose gel (McDonell, et al., J. 
Mol Biol 110, 199-146, 1977). For the RTs from other vendors, each was 
assayed following the manufacturer's suggestion. 

RESULTS AND DISCUSSION 

[0363] Designation of polypeptides. Unless otherwise indicated, the 

following designations will be used to describe various polypeptides of the 
invention. SUPERSCRIPT™ II is M-MLV reverse transcriptase with a 
methionine added as a start codon (as discussed earlier , wild type M-MLV RT 
is a cleavage product that does not contain a methionine) and having three 
point mutations that reduce RNase H activity. EFN indicates a polypeptide 
having the Superscript™ II sequence with an N-terminal six histidine tag 
sequence MGGSHHHHHHGMASMTGGQQMGRDLYDDDDKH (amino 
acids 1-32 of SEQ ID NO:2 and Table 3) and containing the EFN mutations, 
which are H204R, T306K, and F309N of the Superscript™ II sequence. 
Tagged, no His EFN has the tag sequence 
MASGTGGQQMGRDLYDDDDKH (SEQ ID NO:3) and the EFN mutations, 
no tag EFN contains the EFN mutations. SUPERSCRIPT™ III, which may be 
designated LEFN, has the SUPERSCRIPT™ II sequence and an N-terminal tag 



- 157- 

sequence MASGTGGQQMGRDLYDDDDK (SEQ ID NO: 11) and the LEFN 
mutations, which are H204R, M289L, T306K, and F309N of the 
Superscript™ II sequence. His tagged LEFN indicates a polypeptide having 
an N-terminal six histidine tag sequence 

MGGSHHHHHHGMASMTGGQQMGRDLYDDDDKH (amino acids 1-32 
of SEQ ID NO:2 and Table 3) and containing the LEFN mutations. 
[0364] Purification: The protein expresses well and appears to not be toxic. 

Varying levels of arabinose induction were used with 0.2% being optimal but 
not very different from 2.0%. Induction times longer than 3 hours can be 
done, however there is a 70 kDa product that is generated over longer 
incubation periods. It appears to be a proteolysis product removing the C- 
terminus as purified N-terminal Histidine-tagged clones also had this protein 
co-purify. 

[0365] The purification protocol is essentially the same as was used for 

Superscript™ II and the Superscript™ III enzyme behaves almost the 
same throughout the purification. SUPERSCRIPT™ III may be prepared at 
concentrations of 2,000-4,000 units/jal after dialysis with no precipitation 
problems. 

[0366] Fidelity: Initial results have shown no difference between 

Superscript™ II and Superscript™ III using either the rpsL or lacZoc 
fidelity assay. RTs have sequence dependant mutation hotspots in 
homopolymeric runs of nucleotides, primarily runs of A or T. These most 
common mutations at these runs are one nucleotide insertion or deletion which 
is caused by the template or primer breathing and re-annealing with an 
overlap. The resulting frameshift mutation usually results in premature 
termination of the gene product. The modified EFN polypeptide, which 
contains the H204R, T306K, and F309N mutations, had improved frameshift 
error rates compared to SUPERSCRIPT™ II and MMLV H+. Experiments are 
ongoing to determine if Superscript™ III, which contains the M289L 
mutation in addition to the H204R, T306K, and F309N mutations, also has 
this improvement. 

[0367] Specific Activity: The specific activity for SUPERSCRIPT™ III is about 

25% higher than SUPERSCRIPT™ II. RT DNA polymerase unit activity was 
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assayed with (rA) 6 3o*p(dT)i 2 -i8 at 37°C and protein concentration was 
determined by Bradford assay. Table 14 shows the RNA-directed DNA 
polymerase specific activity of SUPERSCRIPT™ II and SUPERSCRIPT™ III 
using (rA) 6 30*p(dT)i2-i8 at 37°C for 10 min. 



Table 14. 



Enzyme 


DNA Polymerase Specific Activity 
(units/mg) 


Superscript™ II 


330,000 


Superscript™ III 


410,000 



[0368] Mono- and Divalent Metal Reaction Optima: Using primed CAT RNA 

as substrate, the Mg 2+ and KC1 concentration optima were determined and are 
shown in Figures 10A and 10B. The DNA polymerase assay for 
Superscript™ III RT was conducted at 37°C or 50°C for 10 minutes under 
various concentrations of Mg 2+ (Fig. 10A) or KC1 (Fig. 10B). SUPERSCRIPT™ 
II at 37°C is included for comparison. The Mg 2+ concentration optima is at 3 
mM at both 37°C and 50°C. Though at 50°C there appears to be a broader 
working range of 2-5 mM without much difference. For KC1, the optima is at 
75 mM at both temperatures, but the working range is from 25 - 125 mM 
without much difference. Again at the higher temperature a broader range is 
allowed (25-200 mM). Since there is no change in optima between 
Superscript™ II and III, the 5X First Strand Buffer from Superscript™ II 
can be used as Superscript™ Ill's reaction buffer. 

[0369] TdT activity: TdT or non-template directed nucleotide addition to the 

3' primer end is a well known property of many polymerases including RTs 
(Chen,D. et aL, Biotechniques 2001; 30(3):574-582). Using a DNA template 
(Fig. 1 1 A), Superscript™ II adds 1-3 additional nucleotides to the end of the 
transcript. SUPERSCRIPT™ III however has a greatly reduced TdT and on a 
DNA template will not add any detectable nucleotides. On an RNA template 
(Fig. 1 IB), Superscript™ II adds 1-2 bases with 69% of the primer extended 
after 10 minutes and 90% extended after an hour. Superscript™ III does 
have some TdT activity on RNA template, but it is reduced with about 14% 
extended after 10 minutes and 50% extended after an hour at 45°C as shown in 
Figure 11B. This TdT activity is biased with the addition of dATP being 
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strongly favored followed by dGTP, dCTP and dTTP for both Superscript™ 
II and III. TdT activity is temperature and time dependant. The optimal 
temperature for Superscript™ III is 45 °C to 50°C while Superscript™ II is 
at 45°C (Figure 11). To further minimize TdT activity in any RT one must 
shorten the extension time or lower the temperature. 
[0370] Fidelity assays. The results of the fidelity assays are shown in Tables 

15 and 16. 



Table 15. rpsL Fidelity Assay 



RT 


Temp 


Wt rpsL 


Mutated rpsL 


Superscript™ III 


45 U C 


3.9 X10" 4 


3.4 X10" 5 


Superscript™ II 


45°C 


3.5 xicr 4 


3.1 X10 -5 


MMLV H+ 


37°C 


3:9 X1Q- 4 


NA 



Table 16. lacZa Fidelity Assay 



RT 


Temp 


Error Rate 


Superscript™ III 


45°C 


2.9 XI O" 5 


50°C 


2.7 X10" 5 


55°C 


1.8 X10" 5 


Superscript™ II 


45°C 


2.9 X10" 5 


MMLV H+ 


45 U C 


4.8 X10" 5 



[0371] Fidelity assays for SUPERSCRIPT™ III, SUPERSCRIPT™ II and MMLV 

Rnase H+ RTs. rpsL assay was conducted with either WT rpsL gene or rpsL 
gene with mutation hotspots removed. A wt rpsL DNA fragment was used as 
control to see the spontaneously mutation. The error rate is about 10~ 6 to 10" 7 . 
The lacZa assay was conducted with gapped DNA starting substrate used as a 
control with a background error rate of 2.2 x 10" 5 . 

[0372] Thermal Stability of Purified Mutant RTs: The rate of chemical RNA 

breakdown catalyzed by Mg 2+ increases dramatically as the temperature is 
increased above 55°C to 60°C under cDNA synthesis reaction conditions. In 
order to characterize their intrinsic thermal stability, the half lives of M-MLV, 
Superscript™ II and Superscript™ III were measured at elevated 
temperatures. The RT enzymes were first diluted to 0.2-0.5 Unit/jal in the IX 
first strand buffer in the presence of 0.05% of NP-40. If diluted in absence of 
detergent, the RT activity is reduced to more than 90% without incubation 
(data not shown). Figures 12A, 12B, and 12C and Table 17 show the half- 
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lives of M-MLV, SUPERSCRIPT™ II, and SUPERSCRIPT™ III at various 
temperatures. The RT half-lives were detected in IX First strand reaction 
buffer in presence of 0.05% NP-40. Mixtures (20 were incubated for 
various times in 0.5-ml tubes in a thermocycler at 50°C, 55°C, or 60°C and 
contained IX First Strand Buffer (50 mM Tris-HCl (pH 8.3), 75 mM KC1, 3 
mM MgCl 2 ), 10 mM DTT, 3 mM MgCl 2 (total to 6 mM), 0.5 mM dTTP, 
0.05% (v/v) NP-40, [methyl- 3 H]dTTP (20^Ci/ml), and 5 |xl of RT. Incubation 
was stopped by placing the tube in ice. An aliquot (5 pi) was assayed for 
remaining units with 5 mM poly(A)/3 mM oligo(dT)25. The half live of 
Superscript™ III at 50°C was 220 minutes, about 35 fold longer than that of 
- Superscript™ II. The half live of Superscript™ III at 55°C was 24 min, 
-10 fold longer than that of Superscript™ II. 



Table 17. Summary of RT Half lives at 50°C, 55°C, and 60°C 





M-MLV 


Superscript™ II 


Superscript™ III 




(min) 


(min) 


(min) 


50°C 


ND 


6.1 


220 


55°C 


1.5 


2.2 


24 


60°C 


0.6 


ND 


2.5 



[0373] Full-length cDNA profiling of RT: A practical method forjudging the 

useful higher temperature of the DNA polymerase activity of RT is an 
assessment of the effect of increasing reaction temperature on the amounts of 
full-length cDNA products synthesized by RT from mixture of cRNA of 
various lengths. The labeled full-length cDNA products can be separated and 
quantified on an alkaline agarose gel. As the reaction temperate is increased, 
full-length cDNA products disappear starting with those derived from longer 
cRNA until a temperature is reached where no discernible full-length product 
of any length is synthesized. For comparison, we included Clontech's 
PowerScript RT, Stratagene's StrataScript RT, Promega's ImProm II RT, and 
Superscript™ II (Invitrogen Corporation, Carlsbad, CA, catalog # 
18064022). Only SUPERSCRIPT™ III could produce a clear 9.5 kb full-length 
product at 50°C, and SUPERSCRIPT™ II can get a very faint 9.5 kb band. 

[0374] To compare RT from different companies, the same number of units of 

RT were used for comparison. PowerScript RT is -350 unit/jxl, StrataScript 
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RT is 50 unit/jxl, ImProm II is 50 unit/ul. SUPERSCRIPT™ II, SUPERSCRIPT™ 
III and PowerScript RT were diluted to 50 unit/ul in RT dilution buffer. 100 
units (2 nl) of RT enzyme were used. The amounts of full-length products 
generated were determined and are shown in Tables 17 and 18 for various 
length templates. SUPERSCRIPT™ III RT shows improved performance at 
50°C and 55 °C compared to the other RTs. 



Table 17. Comparison of full length cDNA produced by various RTs. 



Temp. 


RT 


Total cDNA 


Amount of Full-Length cDNA 










Pmol 


7.5 kb (ng) 


2.4 kb (ng) 




PowerScript 


293 


75 


64 i 




StrataScript 


199 


47 


35 


45°C 


ImProm II 


256 


41 


44 


Superscript™ III 


345 


99 


72 




Superscript™ II 


300 


100 


59 




PowerScript 


180 


54 


31 




StrataScript 


9 


<2 


<2 


50°C 


ImProm II 


42 


<2 


<2 


Superscript™ III 


323 


97 


68 | 




Superscript™ II 


170 


46 


22 




PowerScript 


5 


<2 


<2 




StrataScript 


5 


<2 


<2 


55°C 


ImProm II 


3 


<2 


<2 




Superscript™ III 


37 


3 


22 




Superscript™ II 


3 


<2 


<2 



[0375] The amounts of full-length product were established by cutting the 

region in a dried 1.2% alkaline agarose gel corresponding to the size of each 
full-length band and counting it in a scintillation counter. Only amounts of 
full-length products > 2 ng could be seen as discernible bands on the gel 
autoradiograph. The cDNA synthesis reactions were performed in 20 jal with 
0.25 jag of 2.4 kb, 0.5 mg of 7.5 kb cRNA, and 0.1 ^ig of oligo (dT) 25 . If 
provided, buffers and other components were used from their own kits plus 0.1 
Hi of oc- 32 P-dCTP (10 nCi/|il). All the reactions were "hot start." In "hot 
start" reactions, tubes were incubated at reaction temperature for 2 minutes 
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before adding enzyme. 2 pi of RT (50 u/|al) was added and incubated at 45°C 
or 50°C or 55°C for 50 min. The reactions were stopped by adding 10 jal of 
0.5 M EDTA. After ethanol precipitation, the products were loaded on 1.2% 
agarose gel. The dried gel was exposed on the film for 1 hour. 



Table 18. Comparison of full length cDNA produced by various RTs. 





Temp (°C) 


Amount of full length cDNA (ng) 






9.5 kb 


7.5 kb 


S uperS cript™ 


/I c 

45 


19 


54 


TTT 

111 


50 


26 


78 




55 


13 


53 


StratScnpt 


45 


4 


10 




50 


0 


0 




55 


0 


0 


ImProm II 


45 


3 


9 




50 


0 


0 




55 


0 


0 


OmniScript 


45 


0 


0 




50 


0 


0 




55 


0 


0 


PowerScript 


45 


19 


38 




50 


3 


25 




55 


0 


0 


M-MLV 


45 


10 


39 




50 


0 


3 




55 


0 


0 


ThermoScript 


45 


16 


38 




50 


29 


56 




55 


3 


13 



[0376] cDNA synthesis reaction mixtures contained 9.5 kb (1.1 jag) and 7.5 kb 

(0.8 jag) cRNA. The cDNA synthesis reactions were performed in 20 jil with 
20 |^g of Hela total RNA, and 0.5 jag of oligo (dT) 2 s. If provided, buffers and 
other components used were from their own kits plus 0.1 jlxI of a- 32 P-dCTP 
(10 \iC\I \x\). All the reactions were "hot start," 1 \xl of RT was added and 
incubated at 45°C or 55°C for 50 min. 400 or 200 units of SUPERSCRIPT™ III 
were used for comparison. The reactions were stopped by adding 10 juil of 0.5 
M EDTA. After ethanol precipitation, the products were loaded on 1.2% 
agarose gel. The dried gel was exposed on the film for 1 hour. 
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In an alternate procedure for comparing RTs from various sources, a 
comparison of cDNA synthesis from Hela total RNA was performed. The 
cDNA size synthesized by Superscript™ III RT at 55°C is from 0.5-10 kb. 
cDNA synthesis reactions were performed in 20 fxl with 20 jag of Hela total 
RNA, and 0.5 (xg of oligo(dT) 2 5. If provided, buffers and other components 
used were from their own kits plus 0.1 jil of a- 32 P-dCTP (10 pCi/ jal). All the 
reactions were "hot start," 1 jxl of RT was added and incubated at 45 °C or 
55°C for 50 min. 400 or 200 units of Superscript™ III were used for 
comparison. The reactions were stopped by adding 10 jllI of 0.5 M EDTA. 
Total cDNA synthesis was obtained by TCA-precipitation of 5 (^1 of mixtures. 
After ethanol precipitation of the rest of the mixture, the products were loaded 
on 1.2% agarose gel. The dried gel was exposed on the film for 1 hour. 

Each reaction was performed according to each vendor's 
recommendation. In these experiments, we used the buffers provided with the 
RTs. DTT, RNase inhibitor, dNTP were used at the recommended 
concentration and from the kit if provided. Otherwise, Invitrogen's reagents 
were used. 1 jal of RT was used in these competitive audit. Table 18 shows 
the result when using 9.5 kb and 7.5 kb cRNA. Fig. 13 shows the results of a 
competitive audit for cDNA profiling using poly (A)-tailed RNA ladder. The 
cDNA synthesis reactions were performed in 20 \xl with 2 jag of RNA Ladder 
and 0.5 \ig of oligo(dT)25« If provided, buffers and other components were 
used from their own kits plus 0.1 \il of <x- 32 P-dCTP (10 jiCi/ jal). All the 
reactions were "hot start," 1 |il of RT was added and incubated at 45°C or 
50°C or 55°C for 50 min. 400 or 200 units of SUPERSCRIPT™ III were used for 
comparison. The reactions were stopped by adding 10 jal of 0.5 M EDTA. 
After ethanol precipitation, the products were loaded in different order on two 
1 .2% agarose gels. The dried gels were exposed on the film for 1 hour. 

EXAMPLE 8 
RT-PCR applications. 
RT-PCR has become an effective tool in such applications as detecting 
RNA, gene expression, profiling, gene quantitation, and cloning full-length 
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genes. An RNase H minus mutant of MMLV RT, SUPERSCRIPT™ II RT, is 
widely accepted as the RT choice since it has already proven that it produces 
higher cDNA yield and longer target capacity than MMLV RT. 

[0380] Recently we have engineered a new generation RT - Superscript™ 

III™ RT. In addition to all the premier features of SUPERSCRIPT™ II RT, it 
also provides a high temperature RT capacity up to 55 degrees, which is 
approximately 5 degrees or more higher than SUPERSCRIPT™ II RT. The 
enzyme half-life of SUPERSCRIPT™ II RT and SUPERSCRIPT™ III RT at 50°C 
is 6 minutes and 220 minutes respectively. Efficient full-length cDNA 
synthesis activity occurs with Superscript™ III even at 55 degrees RT 
temperature. High yield and more specific RT-PCR products were also 
produced by this enzyme at elevated RT temperatures. Superscript™ III 
provides a tool for efficient cDNA synthesis for difficult RNA targets such as 
high GC and secondary structured templates. 

[0381] The enzyme has also been optimized for RT-PCR for both sensitivity 

and long targets. We have demonstrated that Superscript™ III RT is able to 
amplify as little as 1 pg of starting HeLa RNA and amplify targets up to 12.3 
kb in length. This enzyme, when used with the accompanying buffers and 
conditions, provides performance better than that of other MMLV derivative 
reverse transcriptases. 

MATERIALS AND METHODS 

[0382] Kit Components: SUPERSCRIPT™ III RT (200 U/^l), 5X First-Strand 

Buffer, 0.1MDTT. 

[0383] SUPERSCRIPT™ III RT = H204R, M289L, T306K, and F309N 

mutations and may be referred to as the LEFN RT. 
[0384] 5X First-Strand Buffer: 250 mM Tris-HCl (pH 8.3), 375 mM 

potassium chloride, and 15 mM magnesium chloride. 
[0385] Superscript™ III RT Storage Buffer: 20 mM Tris-HCl, pH 7.5, 100 

mM NaCl, 1 mM EDTA, 1 mM DTT, 0.01% NP-40, 50% (v/v) glycerol. 
[0386] RNA: Total HeLa RNA and total rat brain RNA were isolated using 

TRIZOL® Reagent. 
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[03871 Gel Electrophoresis: RT-PCR products (10 jxl) were analyzed by 

electrophoresis on 0.8% to 1.5% (w/v) agarose gels in 0.5X TBE with 0.4 
jag/ml ethidium bromide. 

[0388] RT-PCR Procedure: First-Strand cDNA Synthesis for RT-PCR: A 20- 

\A reaction volume can be used for 10 pg - 5 |ag of total RNA or 10 pg - 500 
ng of mRNA. Add the following components to a nuclease-free microfiige 
tube: 1 jul Oligo(dT) 20 (500 ^ig/ml), 10 pg to 5 |ag total RNA, 1 fxl 10 mM 
dNTP mix (10 mM each dATP, dTTP, dGTP, and dCTP at neutral pH), and 
sterile, distilled water to 12 |il. Alternatively, 50-250 ng random primers or 2 
pmole of a gene specific primer may be used. Use of random primers requires 
incubation at 25 °C for 10 min before the 50°C incubation. 

[0389] Heat the mixture at 65 °C for 5 min and then place on ice. Collect the 

contents of the tube by brief centrifugation and add: 4 jal 5X First-Strand 
Buffer, 2 jal 0.1 M DTT, 1 jal RNaseOUT Recombinant Ribonuclease Inhibitor 
(40 units/^il, when using less than 50 ng of starting RNA, the addition of an 
RNase inhibitor {e.g., RNaseOUT, Invitrogen Corporation, Carlsbad, CA, 
catalog # 10777019) is essential). 

[0390] Mix the contents of the tube gently and incubate at 50°C for 2 minutes. 

Add 1 \x\ (200-400 units) of SUPERSCRIPT™ III RT, mix by pipetting gently up 
and down. Incubate 50 min at 50°C (cDNA synthesis can be performed at 
42°C-60°C for oligo(dT) 2 o or gene specific primers). Inactivate the reaction 
by heating at 70°C for 1 5 min. The cDNA can now be used as a template for 
amplification in PCR or can be stored at -20°C until use. However, 
amplification of some PCR targets (those >1 kb) may require removal of RNA 
complementary to the cDNA. To remove RNA complementary to the cDNA, 
add 1 1^1 (2 units) of E. coli RNase H and incubate at 37°C for 20 min. 

RT-PCR Optimization 

[0391] For the following experiments, either EFN (His-tag), EFN (no tag), 

LEFN (His-tag), or LEFN (tag, no His) were used in finding the optimal 
conditions for LEFN (tag, no His). These 4 reverse transcriptases differ in 
their thermal-stability profiles. The modified version of SUPERSCRIPT™ III, 
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EFN, contains the H204R, T306K, and F309N mutations. LEFN, otherwise 
known as SUPERSCRIPT™ III RT, contains the M289L mutation in addition to 
the H204R, T306K, and F309N mutations. 

[0392] Unless otherwise noted, 20 ul RT reactions were typically done using 

2.5 uM 01igo(dT) 2 o (500 ng 01igo(dT)i 2 -i 8 for SUPERSCRIPT™ II RT), 0.5 mM 
dNTPs, IX First-Strand Buffer, 10 mM DTT, and 40 units of RNaseOUT. In 
"hot start" reactions, tubes were incubated at reaction temperature for 2 
minutes before adding enzyme. Reactions were treated with 2 units of RNase 
H at 37°C for 20 min after cDNA synthesis. 

[0393] For the evaluation of these. RTs in RT-PCR, standard 50 ul PCR 

reactions were performed. For primer sets deigned to amplify < 4 kb, PCR 
reactions consisted of 0.2 uM primers, 200 uM each dNTP, IX PCR Buffer, 
1.5 mM MgCl 2 , 2 ul of the cDNA reaction, and 2 units of PLATINUM® Taq 
DNA Polymerase. For primer sets designed to amplify > 4 kb, PCR reactions 
consisted of 0.2 uM primers, 200 uM each dNTP, IX High Fidelity PCR 
Buffer, 2 mM MgS0 4 , 2 ul of the cDNA reaction, and 1 unit of PLATINUM® 
Taq DNA Polymerase High Fidelity. After an incubation of 94°C for 2 min, 
amplification was 35 to 40 cycles of 94°C for 15 s, 55°C - 60°C for 30 s, and 
68°C for 1 min/kb. 10 ul of the PCR reactions were analyzed on agarose gels 
containing 0.4 ug/ml EtBr. Primers used in PCR are found in Table 19. 

[0394] cDNA synthesis buffer: Reactions with 50 or 200 units of 

Superscript™ II RT or EFN (His-tag) were assembled using the reagent 
systems of Superscript™ II RT (stand-alone), Superscript™ II First- 
Strand Synthesis System for RT-PCR, and ThermoScript RT-PCR System. 
The conditions for each are as follows: 

1) Superscript™ II RT stand-alone: 50 mM Tris-HCl 
(pH 8.3), 75 mM KC1, 3 mM MgCl 2 , 0.5 mM dNTPs 

2) Superscript™ II First-Strand Synthesis System for 
RT-PCR: 20 mM Tris-HCl (pH 8.4), 50 mM KC1, 5 
mM MgCl 2 , 0.5 mM dNTPs 

3) ThermoScript RT-PCR System: 50 mM Tris-acetate 
(pH 8.4), 75 mM potassium acetate, 8 mM magnesium 
acetate, 1 mM dNTPs 
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20 |il RT reactions containing 1 pg to 100 ng of starting total HeLa RNA were 
performed at 45°C for 50 min followed by 85°C for 5 min. 2 jil of the 
resulting cDNA were added to 50 |il PCR reactions containing the p-actin 353 
bp, BF 2.4 kb, Pol s 6.8 kb, or APC 8.5 kb primer set (see Table 19). 
[0395] Magnesium chloride was titrated into cDNA synthesis buffers in final 

reaction concentrations of 1 to 10 mM, with dNTP concentrations of 0.5 mM 
and 1 mM tested concurrently. These buffers were used in 20 jjlI RT reactions 
performed at 45°C for 50 min followed by 85°C for 5 min with 1 pg, 100 ng, 
or 5 ng of total HeLa RNA and 50 units or 200 units of EFN (His-tag). 2 |Ltl 
of the resulting cDNA were added -to 50 |il PCR reactions containing the p- 
actin 353 bp or Pol s 6.8 kb target (the 5 [ig cDNA samples were diluted 100 
fold before addition). 

[0396] 5X First-Strand Buffer containing Tris-HCl at pH 8.0, 8.4, 8.8 and 8.3 

were prepared to determine the ideal pH for the enzyme to function at higher 
reaction temperatures. 20 RT reactions containing 100 ng to 500 ng of total 
HeLa RNA and 200 units of SUPERSCRIPT™ II or 400 units of LEFN (His-tag) 
were performed at 50°C - 60°C for 50 min (hot start) followed by 70°C for 15 
min using these different buffers. 2 nl of the resulting cDNA were added to 
50 ^1 PCR reactions containing the BF 2.4 kb or Pol s 6.8 kb primer set. 

[0397] RT reaction temperature: SUPERSCRIPT™ II RT (50 units), EFN (His- 

tag) (50 and 200 units), and ThermoScript RT (15 units) were compared in RT 
reactions from 42°C to 60°C. 20 ^1 RT reactions containing 1 pg to 100 ng of 
starting total HeLa RNA were performed at the given temperatures for 50 min 
(hot start) followed by 85°C for 5 min. 2 |al of the resulting cDNA were added 
to 50 PCR reactions containing the p-actin 353 bp, TSC 5.3 kb, or Pol s 6.8 
kb primer set. 

[0398] Superscript™ II RT and EFN (no tag) were compared at 200 units in 

RT reactions at 45°C and 50°C. 20 |al RT reactions containing 1 ng to 1 ng of 
total HeLa RNA were performed at the given temperatures for 50 min (hot 
start) followed by 70°C for 15 min. 2 ^1 of the resulting cDNA were added to 
50 p.1 PCR reactions containing primer sets from 2.4 kb to 9.3 kb. 

[0399] Superscript™ II RT, EFN (His-tag), and EFN (no tag) were 

compared at 200 units in RT reactions from 45°C to 55°C. 20 |il RT reactions 
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containing 1 pg to 100 ng of total HeLa RNA were performed at the given 
temperatures for 50 min (hot start) followed by 70°C for 15 min. 2 |il of the 
resulting cDNA were added to 50 \il PCR reactions containing either the p- 
actin 353 bp or Pol 8 6.8 kb primer set. 

[0400] Superscript™ II RT, EFN (His-tag), EFN (no tag), and LEFN (His- 

tag) were compared at 200 units in RT reactions from 45°C to 55°C. 20 \il RT 
reactions containing 1 pg to 1 |ag of total HeLa RNA were performed at the 
given temperatures for 50 min (hot start) followed by 70°C for 1 5 min. 2 ^1 of 
the resulting cDNA were added to 50 [il PCR reactions containing either the 
p-actin 353 bp or Pol e 6.8 kb primer set. 

[0401] Superscript™ II RT, EFN (His-tag), EFN (no tag), and LEFN (His- 

tag) were compared at 200 units in RT reactions from 45 °C to 55°C with a 
gene-specific primer (CBP 1.6 kb) instead of Oligo(dT). 20 |il RT reactions 
containing 1 ng or 10 ng of total HeLa RNA were performed at the given 
temperatures for 50 min (hot start) followed by 70°C for 15 min. 2 ^il of the 
resulting cDNA were added to 50 |xl PCR reactions containing the CBP 1.6 kb 
primer set. 

[0402] Superscript™ II RT, EFN (no tag), EFN (His-tag), and LEFN (His- 

tag) were compared at 200 and 400 units in RT reactions from 45°C to 60°C. 
20 |il RT reactions containing 500 ng of total HeLa RNA were performed at 
the given temperatures for 50 min (hot start) followed by 70°C for 15 min. 2 
|il of the resulting cDNA were added to 50 (il PCR reactions containing either 
the BF 2.4 kb or Pol e 6.8 kb primer set. 

[0403] Superscript™ II RT, EFN (His-tag), LEFN (His-tag), and LEFN 

(tag, no His) were compared at 200 and 400 units in RT reactions from 45°C 
to 60°C. 20 |il RT reactions containing 500 ng of total HeLa RNA were 
performed at the given temperatures for 50 min (hot start) followed by 70°C 
for 15 min. 2 (xl of the resulting cDNA were added to 50 |il PCR reactions 
containing either the BF 2.4 kb or Pol s 6.8 kb primer set. 

[0404] RT enzyme concentration: 20 |ul RT reactions containing 0. 1 pg to 1 

(ig of starting total HeLa RNA and 25 units to 250 units of EFN (His-tag) 
were performed at 45°C for 50 min followed by 85°C for 5 min. 2 |il of the 
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resulting cDNA were added to 50 ^il PCR reactions containing either the 0- 
actin 353 bp, CBP 1.6 kb, TSC 5.3 kb, Pol e 6.8 kb, or APC 8.5 kb primer set. 

[0405] Superscript™ II RT and LEFN (His-tag) were compared at 200 units 

and 400 units in RT reactions from 50°C to 60°C. 20 |il RT reactions 
containing 10 ng to 1 |ig of total HeLa RNA were performed at the given 
temperatures for 50 min (hot start) followed by 70°C for 15 min. 2 |al of the 
resulting cDNA were added to 50 |il PCR reactions containing either BF 2.4 
kb or Pol 8 6.8 kb primer set. 

[0406] Superscript™ II RT and LEFN (His-tag) were compared at 200 units 

and 400 units (LEFN was also used at 800 units) in RT reactions (with or 
without 0.05% Triton X-100) from 50°C to 60°C. 2O 41I RT reactions 
containing 500 ng of total HeLa RNA were performed at the given 
temperatures for 50 min (hot start) followed by 70°C for 15 min. 2 jal of the 
resulting cDNA were added to 50 (il PCR reactions containing either BF 2.4 
kb or Pol e 6.8 kb primer set. Triton did not improve the yield of the reaction 
and actually reduced the yield slightly. 

[0407] LEFN (tag, no His) was compared at 50, 200, and 400 units to 

determine the effect on sensitivity. 20 \xl RT reactions containing 0.1 pg to 
100 pg of total HeLa RNA were performed at 50°C under both hot start and 
cold start conditions followed by 70°C for 15 min. 2 [il of the resulting cDNA 
were added to 50 |il PCR reactions containing either the p-actin 353 bp or 
GAPDH 532 bp primer set. 

[0408] Comparison of 85°C, 5 min and 70°C, 15 min Inactivation Steps. 

Superscript™ II RT and EFN (no tag) were compared in RT reactions with 
either a 70°C, 15 min or an 85°, 5 min inactivation step. 20 ^1 RT reactions 
containing 1 pg to 100 ng of starting total HeLa RNA and 200 units of RT 
were performed at 45°C for 50 min followed by one of the inactivation steps. 
2 \xl of the resulting cDNA were added to 50 \il PCR reactions containing 
either the p-actin 353 bp, CBP 1.6 kb, Pol s 3.5 kb, TSC 5.3 kb, or Pol 8 6.8 
kb primer set. 

[0409] Priming with Oligo(dT)20, Random Hexamers, and Gene-Specific 

Primers (GSP). EFN (no tag) and SUPERSCRIPT™ II RT were used in RT 
reactions containing either 2.5 jaM oligo(dT) 2 o, 50 ng random hexamers, or 0.1 
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uM GSP. 20 ul RT reactions containing 1 ng to 1 ug of starting total HeLa 
RNA and 200 units of RT were performed at 45°C for 50 min followed by 
70°C for 15 min. 2 ul of the resulting cDNA were added to 50 ul PCR 
reactions containing either the CBP 1.6 kb, TSC 5.3 kb, or APC 8.5 kb primer 
set (Fig. 21). 

Comparison of reverse transcriptases from various sources. LEFN 
(tag, no His) was compared to Clontech PowerScript™ RT, Stratagene 
StrataScript™ RT, Qiagen SensiScript™ RT, Qiagen OmniScript™ RT, and 
Promega ImProm-II™ RT System. RT reactions containing 0.1 pg to 1 ug of 
starting total HeLa RNA or 1- ug of rat brain RNA were performed in 
duplicate with each of the reverse transcriptases using the procedure and 
conditions recommended by each supplier. 10% of the cDNA reactions were 
added to 50 ul PCR reactions containing either the P-actin 353 bp, GAPDH 
532 bp, CBP 1.6 kb, BF 2.4 kb, VIN 4.6 kb, FIB 9.4 kb, or Dynein 12.3 kb 
primer set. 
Table 19: Primer list 

Human P-actin - 353 bp 

sense GCTCGTCGTCGACAACGGCTC (SEQ ID NO: 1 2) 

antisense CAAACATGATCTGGGTCATCTTCTC (SEQ ID NO: 1 3) 

Human GAPDH - 532 bp 

sense GTGAAGGTCGGAGTCAACGGATTT (SEQ ID NO: 14) 

antisense CACAGTCTTCTGGGTGGCAGTGAT (SEQ ID NO: 1 5) 

Human Cap Binding Protein (CBP) - 1.6 kb 
sense ATGGCGATCGTCGAACCGGA (SEQ ID NO: 1 6) 

antisense CACTGTCTTAATATGAATGGGACCTACTGAG (SEQ ID 
NO: 17) 

Human B-factor properdin (BF) - 2.4 kb 

sense GAGCCAAGCAGACAAGCAAAGCAAGC (SEQ ID NO: 1 8) 

antisense TGTTTTAATTCAATCCCACGCCCCTGT (SEQ ID NO:19) 

Human DNA Polymerase e (Pol e) - 3.5 kb 

sense AAGGCTGGCGGATTACTGCC (SEQ ID NO:20) 

antisense GATGCTGCTGGTGATGTACTC (SEQ ID NO:21 ) 



Human Vinculin (VIN) - 4.6 kb 

sense GAGGAGGGCGAGGTGGACGGC (SEQ ID NO:22) 
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antisense GAACTAACACACAGCGATGGGTGGGAA (SEQ ID 
NO:23) 

Human Tuberous Sclerosis 2 (TSC-2) - 5.3 kb 

sense GGAGTTTATCATCACCGCGGAAATACTGAGAG (SEQ 

ID NO:24) 

antisense TATTTCACTGACAGGCAATACCGTCCAAGG (SEQ ID 
NO:25) 

Human DNA Polymerase s (Pol e) - 6.8 kb 

sense CGCCAAATTTCTCCCCTGAA (SEQ ID NO:26) 

antisense CCGTAGTGCTGGGCAATGTTC (SEQ ID NO:27) 

Human Adenomatous Polyposis coli (APC) - 8.5 kb 

sense GCTGCAGCTTCATATGATCAGTTGTTA (SEQ ID NO:28) 

antisense AATGGCGCTTAGGACTTTGG (SEQ ID NO:29) 

Human Fibrillin (FIB) - 9.4 kb 

sense TGGAGGCTGGGAACGTGAAGGAAA (SEQ ID NO:30) 

antisense ACAGGAATGACCGAGGGTAATCTTGGC (SEQ ID 
NO:31) 

Rat Dynein - 12.3 kb 

sense GCGGCGCTGGAGGAGAA (SEQ ID NO:32) 

antisense AGGTGGCGGCTCAAACACAAAG (SEQ ID NO:33) 

H-fibroblast growth factor 11 (FGF)-240 bp; 97 °C denaturing, 60 °C 
annealing temp. 

sense (fl-60)- CGGGTGGTAACTGGCTGCTGTGGA (SEQ ID NO:34) 
antisense TGGAGGCTGGGAACGTGAAGGAAA (SEQ ID NO:35) 
antisense (r2-299)- GCGGACCTCCCGCTTCTGCCGGA (SEQ ID NO:36) 

H-cystathionine-beta-synthase (CBS 2.4)-2390 bp; 64 °C annealing 
temperature 

sense (f2-71)- CCAAGTAAAACAGCATCGGAACACCAGG (SEQ ID 
NO:37) 

antisense (r2-2460)- AAAGTCGATCAGCAGTTGCCAGGGG (SEQ ID 
NO:38) 

H-topoisomerase I (TOP3.2)-3162 bp; 60 °C annealing temperature 
sense (f2-80)- CCCACAGTCACCGCCGCTTACCT (SEQ ID NO:39) 
antisense (rl-3241)- CTTCATCCCTCCCCAACCCCAATCT (SEQ ID 
NO:40) 

H-vinculin (VIN4.6)- 4641 bp; 66 °C annealing temperature 

sense (fl-132)- GAGGAGGGCGAGGTGGACGGC (SEQ ID NO:41) 

antisense (r2-4772)- GAACTAACACACAGCGATGGGTGGGAA (SEQ ID 

NO:42) 
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H-PolE (PolE 6.8)-6800 bp; 60 °C annealing temperature 
sense - CGCCAAATTTCTCCCCTGAA (SEQ ID NO:43) 
antisense - CCGTAGTGCTGGGCAATGTTC (SEQ ID NO:44) 

H-p-actin (0act353)- 353 bp; 55 °C annealing temperature 
sense - 5' GCTCGTCGTCGACAACGGCTC (SEQ ID NO:45) 
antisense - ACCACATGATCTGGGTCATCTTCTC (SEQ ID NO:46) 



Comparison of Superscript™ III to Superscript II 

[0411] A comparison of Superscript™ III to SUPERSCRIPT™ II was made 

using the following assay. Total HeLa RNA (1 \xg) was combined with 10 
pmol gene specific primer (antisense primer, Table 19) and hybridized at 65 C 
for 5 min. Transcription with 200 U of each enzyme was carried out at 42 °C, 
50 °C, or 55 °C for 50 min. Reverse transcription for comparison of enzyme 
preparations were performed using 1 pg, 10 pg, 100 ng or 1 jag of total RNA 
from HeLa cells with 50 pmol OligodT( 2 0) at 55°C 50 min. Reaction 
components included 0.5 mM dNTPs, 0.01 M DTT, 40 U RNase OUT™, 50 
mM Tris-KCl (pH 8.3), 75 mM KC1, 3 mM MgCl 2 .After reverse transcription, 
the enzyme was heat inactivated by incubation at 70°C for 15 min. After a 20 
min. incubation with 2 U RnaseH at 37°C, 2 jxL of the 20 jiL reaction was 
amplified by PCR using Platinum Taq hi-fidelity. The amplification reaction 
contained 60 mM Tris-S0 4 (pH 8.9), 18 mM NH 4 S0 4 , 2 mM MgS0 4 , 0.2 mM 
each dNTP, 0.2 |aM each primer (Table 19), and 1 U Platinum Taq high 
fidelity. The amplification was carried out by an initial 94 °C denaturation 
step for two min., followed by 35 cycles of the following steps: 94°C 15 sec, 
X°C 30 sec, 68°C 1 min/kb, where X indicates the annealing temperature of 
the primer set listed in Table 19 (55°C - 66°C). High G/C content products 
were amplified with the following cycling beginning with a 94°C denaturation 
step for two min. followed by 35 cycles of the following: 97°C, 15 sec; 60°C 
30 sec; 68°C one min. The reactions were combined with 5 [xL 10X 
Blue Juice® and 20% (11 jiL) of each reaction was electrophorsed through 
0.8% or 1.5% agarose gels stained with 0.5 |ig/ml ethidium bromide. 
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[0412] RT-PCR Optimization: cDNA synthesis buffer. SUPERSCRIPT™ III 

RT was used with the SUPERSCRIPT™ II RT stand-alone conditions. These 
conditions were found to be optimal for Superscript™ III RT in first-strand 
synthesis. No significant difference in performance in RT-PCR was seen with 
Superscript™ III RT using the reaction conditions of Superscript™ II RT 
stand-alone, SUPERSCRIPT™ II First-Strand Synthesis System for RT-PCR, or 
ThermoScript RT-PCR System with a P-actin 353 bp target or with larger 
targets. 

[0413] A magnesium concentration of 3 mM was chosen for SUPERSCRIPT™ 

III RT as this the minimum concentration needed in the RT reaction and it was 
the same concentration found with the SUPERSCRIPT™ II RT stand-alone. 
With 0.5 mM dNTPs, optimal performance was seen with magnesium 
concentrations from 3 mM to 7 mM with both low RNA concentrations of 1 
pg, 100 ng, and 5 |ig. 

[0414] The pH of the 5X First-Strand Buffer will remain at 8.3 as this is the 

pH found in the Superscript™ II RT buffer and little difference was seen 
with different pHs at higher temperatures (Figure 14). 

[0415] RT reaction temperature. 200 units of LEFN (Superscript™ III RT) 

showed little reduction in product up to 50°C, which is 5°C higher than the 
capabilities of Superscript™ II RT (50 U), and 5°C lower than that of 
ThermoScript RT (15 U). Results were similar with both low levels of RNA 
and the p-actin 353 bp primer set, and higher levels of RNA and the TSC 5.3 
kb primer set. 

[0416] When a His-tag is part of the RT, slightly higher thermal-stability is 

noted. RT reactions containing 10 to 1000 ng of total HeLa RNA and 200 
units of Superscript™ II or LEFN RT were run at 45°C to 55°C for 50 
minutes in a hot start reaction. 2 \xl of the resulting cDNA products were then 
added to 50 jlxI PCR reactions containing the Pol e 6.8 kb primer set. Resulting 
PCR products were then run on a 0.8% agarose gel containing 0.4 mg/ml 
ethidium bromide. Not much difference was seen between 200 units of EFN 
(His-tag), EFN (no tag), and LEFN (His-tag) with the P-actin 353 bp primer 
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set with all three showing product up to 55 °C. However, with the Pol e 6.8 kb 
target, product was also seen up to 55°C with all three mutants, except EFN 
(no tag) showed less product at this temperature than either of the mutants 
with His-tags. 

[0417] The same temperature profile as found with oligo(dT) 2 o and the 

advantage of the His-tag was also seen with a gene-specific primer. RT 
reactions containing 1 to 10 ng of total HeLa RNA and 200 units of 
Superscript™ II, EFN, or LEFN RT were run at 45°C to 65°C for 50 
minutes in a hot start reaction. 2 jxl of the resulting cDNA products were then 
added to 50 jxl PCR reactions containing the CBP 1.6 kb primer set. Resulting 
PCR products were then run on a 0.8% agarose gel containing 0.4 mg/ml 
ethidium bromide. Using a gene-specific primer in the RT reaction, 200 units 
of EFN (His-tag), EFN (no tag), and LEFN (His-tag) all still showed CBP 1.6 
kb product up to 55°C, but EFN (no tag) showed slightly less than the others at 
55°C. 

[0418] In order to exploit the stability provided by the His-tag, LEFN with a 

tag minus the 6X His was designed. RT reactions containing 500 ng of total 
HeLa RNA and 200 or 400 units of SUPERSCRIPT™ II, EFN, or LEFN RT 
were run at 45°C to 65°C for 50 minutes in a hot start reaction. 2 jal of the 
resulting cDNA products were then added to 50 |il PCR reactions containing 
the BF 2.4 kb primer set. Resulting PCR products were then run on a 0.8% 
agarose gel containing 0.4 mg/ml ethidium bromide. Little difference was 
seen with the mutant when compared to both EFN (His-tag) and LEFN (His- 
tag), product was seen up to 60°C with all three with the BF 2.4 kb primer set 
(Figure 15). 

[0419] RT enzyme concentration. An experiment comparing units of EFN 

(His-tag) from 25 units to 250 units showed little difference in performance 
over the entire range, when looking at both sensitivity with the p-actin 353 
primer set or with longer targets. RT reactions containing 0.1 to 1000 ng of 
total HeLa RNA and 25 to 250 units of EFN were run at 45°C for 50 minutes 
in a hot start reaction. 2 fxl of the resulting cDNA products were then added to 
50 jxl PCR reactions containing the p-actin 353 bp, CBP 1.6 kb, TSC 5.3 kb, 
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Pol 8 6.8 kb, or APC 8.5 kb primer set. Resulting PCR products were then run 
on a 0.8% agarose gel containing 0.4 mg/ml ethidium bromide. 
[0420] 200 and 400 units of LEFN (tag, no His) showed slightly better 

performance than 50 units with low concentrations of RNA. RT reactions 
containing 0.1 to 100 pg of total HeLa RNA and 50 to 400 units of LEFN RT 
were run at 50°C for 50 minutes in a hot start reaction. 2 jal of the resulting 
cDNA products were then added to 50 jal PCR reactions containing the P-actin 
353 bp primer (not shown) or the GAPDH 532 primer set. Resulting PCR 
products were then run on a 1.5% agarose gel containing 0.4 mg/ml ethidium 
bromide. The higher RT units yielded slightly more PCR product under hot 
start conditions with both p-actin 353 bp (data not shown) and GAPDH 532 
bp (Figure 16). 

[0421] Priming with 01igo(dT) 2 o, Random Hexamers, and Gene-Specific 

Primers (GSP). Superscript™ III RT performed similarly to Superscript™ 
II RT when RT reactions were run with different priming methods. Oligo(dT) 
yielded the cleanest product and highest specific yield with GSP having high 
levels on non-specific products (at 55 degrees RT temperature) and random 
hexamers having high specificity but low yield. 

[0422] Comparison of reverse transcriptases from various sources. Using p- 

actin 353 bp primer set, SUPERSCRIPT™ III RT was again able to detect down 
to 1 pg of starting HeLa RNA. RT reactions containing 1 to 100 pg of total 
HeLa RNA were run with each RT using the reagents and conditions specified 
in each protocol. 2 jal of the resulting cDNA products were then added to 50 
jlxI PCR reactions containing the p-actin 353 bp primer set. Resulting PCR 
products were then run on a 1.5% agarose gel containing 0.4 mg/ml ethidium 
bromide. ImProm II RT and SensiScript were also able to detect down to 1 
pg, but StrataScript and OmniScript could only detect down to 10 pg, and 
PowerScript was unable to even detect 100 pg of starting HeLa RNA. 

[04231 With larger targets and higher concentrations of total HeLa RNA, 

Superscript™ III RT (LEFN tag, no His) performed significantly better than 
most of the competitors in relation to the RT-PCR product yield and length. 
RT reactions containing 10 to 1000 ng of total HeLa RNA were run with each 
RT using the reagents and conditions specified in each protocol. 2 \xl of the 
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resulting cDNA products were then added to 50 |il PCR reactions containing 
the CBP 1.6 kb, BF 2.4 kb, VIN 4.6 kb or 9.4 kb primer set. Resulting PCR 
products were then run on a 0.8% agarose gel containing 0.4 mg/ml ethidium 
bromide. CLONTECH PowerScript only showed product when the total 
HeLa RNA was 1000 ng, and the yield of specific products was not as high as 
with Superscript™ III RT. Qiagen SensiScript and OmniScript RT showed 
sufficient product yield with the smaller targets (CBP 1.6 kb and BF 2.4 kb), 
but were not able to detect the longer targets. Stratagene StrataScript was able 
to detect all the targets but with lower yields than SUPERSCRIPT™ III RT. 
Promega ImProm-II RT System showed performance similar to 
SUPERSCRIPT™ III RT, but did not have as high a yield with the longest target 
(FIB 9.4 kb). 

[0424] Additional reactions with lower starting HeLa RNA concentrations 

showed a similar pattern. RT reactions containing 10 to 100 ng of total HeLa 
RNA were run with each RT using the reagents and conditions specified in 
each protocol. 2 jal of the resulting cDNA products were then added to 50 |Ltl 
PCR reactions containing the CBP 1.6 kb, BF 2.4 kb, VIN 4.6 kb or 9.4 kb 
primer set. Resulting PCR products were then run on a 0.8% agarose gel 
containing 0.4 mg/ml ethidium bromide. CLONTECH PowerScript was 
unable to detect any of the targets with 100 ng or less of starting total HeLa 
RNA. Qiagen SensiScript was able to detect the VIN 4.6 kb target with this 
lower concentration of RNA when it had not been able to detect this target 
previously with 1 |ig of starting total HeLa RNA. Promega ImProm-II did not 
perform as well as Superscript™ III RT with lower concentrations of RNA. 
Superscript™ III RT was the only RT that was able to detect the FIB 9.4 kb 
target with 100 ng starting total HeLa RNA. 

[0425] RT-PCR analysis. Superscript™ II and Superscript™ III 

performance in hot start RT-PCR amplification at 42°C, 50°C, or 55°C were 
compared and the results are shown in Figure 17 SUPERSCRIPT™ II (Panel A) 
or Superscript™ III (Panel B). Lanes (in duplicate) 1, 4, 7, and 10 are 
products reverse transcribed at 42°C. Lanes 2, 5, 8, and 11 are products 
reverse transcribed at 50°C. Lanes 3, 6, 9, and 12 are products transcribed at 
55°C. Lanes 1-3 are the result of RNAs reverse transcribed by gene-specific 
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priming from FGF, lanes 4-6 CBS 2.4, lanes 7-9 from TOP 3.2, lanes 10-12 
VIN 4.6. Arrows indicate expected product sizes of 240 bp, 2390 bp, 3162 bp, 
and 464 lbp. Superscript™ II transcribed more robustly at 50°C whereas 
Superscript™ III best transcribes at 55°C. Amplification at 55°C for some 
applications that require higher annealing temperatures for gene specific 
priming and/or to remove secondary structure from an RNA template may be 
performed with the polypeptides of the invention. This will allow the 
reduction of non-specific priming during reverse transcription with gene 
specific primers and/or the increased reverse transcription of refractory 
templates. 

[0426] Superscript™ III RT: Two different purification techniques. 

Superscript™ III has been purified by two different methods (see above). In 
brief, Method A is similar to the purification technique used for 
SUPERSCRIPT™ II, Method B differs from method A by the use of a membrane 
permeabilization protocol and filtration protocol to reduce cellular debris 
which results in a high-purity preparation. RT-PCR was performed using 200 
U of Superscript™ III purified by Method A or Method B. RT-PCR was 
performed using the Pol 8 6.8 kb primers and p-act 353 bp primers. Product 
yield and quality were compared for both purification methods. The methods 
yield similar results and either are viable methods for purification of 
Superscript™ III. 

[0427] Optimum SUPERSCRIPT™ III RT enzyme concentration. RT reactions 

containing 0.1 pg to 1000 ng of total HeLa RNA and 25 to 250 units of 
Superscript™ II, EFN, or LEFN RT were run at 45°C for 50 min (hot start). 
2 |il of the resulting cDNA were then added to 50 (il PCR reactions containing 
the p-actin 353 bp, CBP 1.6 kb, TSC 5.3 kb, Pol e 6.8 kb, or APC 8.5 kb 
primer set. Resulting PCR products were then run on a 0.8% or 1.5 % agarose 
gel containing 0.4 |ag/ml ethidium bromide. No significant differences was 
obtained from 25 units to 250 units of SUPERSCRIPT™ III RT. (Fig. 18). 
However, higher amount of SUPERSCRIPT™ III RT (400 units) shows higher 
RT-PCR product yield at an elevated RT temperature (55 degrees, Fig. 19). In 
Fig. 19, RT reactions containing 10 to 1000 ng of total HeLa RNA and 200 or 
400 units of SUPERSCRIPT™ II or LEFN RT were run at 50°C to 60°C for 50 
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min (hot start). 2 ^1 of the resulting cDNA were then added to 50 jal PCR 
reactions containing the Pol s 6.8 kb primer set. Resulting PCR products were 
then run on a 0.8% agarose gel containing 0.4 |ig/ml ethidium bromide. 
Unlike Superscript™ II RT, increased amount of Superscript™ III RT (up 
to 400 units) did not inhibit subsequent PCR amplification (Fig. 20). In Fig. 
20, RT reactions containing 0.1 to 100 pg of total HeLa RNA and 50 to 800 
units of LEFN RT were run at 50°C for 50 min in hot start conditions 
(Oligo(dT) 20 ). 2 |il of the resulting cDNA were then added to 50 PCR 
reactions containing the GADPH 532 bp primer set. Resulting PCR products 
were then run on a 0.8% agarose gel containing 0.4 |ag/ml ethidium bromide. 
[0428] Superscript™ III RT has been engineered to increase the temperature 

at which the enzyme can perform RT activity. The enzyme has been 
optimized for RT-PCR for sensitivity, yield, and target length. With this 
optimized system, it was demonstrated that Superscript™ III RT was able to 
detect mRNA targets with as little as 1 pg of starting total HeLa RNA and to 
produce high yield cDNAs from targets up to 12.3 kb in size. It has the ability 
to function at elevated temperatures up to 55°C and to detect a wide variety of 
targets. The performance of this enzyme is superior to any other RTs 
commercially available, and the optimized buffers and RT protocol provide a 
sensitive and robust RNA detection system. 

EXAMPLE 9 

Use of polypeptides of the invention to prepare labeled nucleic acid molecules. 

[0429] Polypeptides of the invention can be used to prepare labeled nucleic 

acids from a variety of templates (e.g., total RNA, mRNA, etc.). For direct 
labeling using a polyA tailed RNA template, suitable reaction conditions may 
entail the use of from about 1 \xg to about 1000 |4,g, from about 1 jag to about 
500 |tig, from about 1 \ig to about 250 from about 1 \xg to about 100 |xg, 
from about 10 (ig to about 1000 fig, from about 10 jig to about 500 jag, from 
about 10 |Lig to about 250 ^ig, or from about 10 jag to about 100 ^ig of RNA. 

[0430] RNA can be mixed with a suitable primer (e.g., an oligo dT primer or a 

gene specific primer). After mixing, the template primer mixture can be 
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incubated at a suitable temperature (e.g., 70°C for an oligo(dT) 25 primer) and 
incubated for a suitable period of time (e.g., 5 minutes). Those skilled in the 
art can readily determine incubation times and temperatures for any template 
primer pair using routine experimentation. The mixture may then be chilled 
on ice and the remaining reaction components added. 
[0431] Suitable reaction components, which may be provided in a reaction 

mixture and/or solution either individually or in combination, include a 
buffering agent, reducing agent, one or more nucleotides, at least one of which 
may contain a label (e.g., a fluorescent label), one or more polypeptide of the 
invention, and suitable, diluent (e.g., H 2 0). A suitable reaction mixture can be 
prepared by combining 4 jul 5X first strand buffer, 2 |^1 0.1M DTT*~1 jal 10 
mM dNTP, 2 jal ImM Fluorescent dNTP, 2 (al SUPERSCRIPT™ III (200u/^il), 
an aliquot containing RNA, and dH 2 0 to 20 Additional reaction mixtures 
and/or solutions, as well as components thereof, which may be used with this 
aspect of the invention are described elsewhere herein. The mixture may be 
incubated at a suitable temperature, for example, from about 42°C to about 
60°C, from about 45°C to about 60°C, from about 48°C to about 60°C, from 
about 50°C to about 60°C, from about 52°C to about 60°C, from about 55°C 
to about 60°C, from about 42°C to about 55°C, from about 45°C to about 
55°C, from about 45°C to about 50°C, from about 45°C to about 48°C, from 
about 48°C to about 60°C, from about 48°C to about 55°C, from about 48°C 
to about 52°C, from about 50°C to about 60°C, from about 50°C to about 
55°C, or from about 50°C to about 52°C. The mixture may be incubated until 
a sufficient incorporation of label is seen, for example, from about 5 minutes 
to about 24 hours, from about 10 minutes to about 24 hours, from about 30 
minutes to about 24 hours, from about 1 hour to about 24 hours, from about 2 
hours to about 24 hours, from about 4 hours to about 24 hours, from about 8 
hours to about 24 hours, from about 30 minutes to about 16 hours, from about 
30 minutes to about 8 hours, from about 30 minutes to about 4 hours, from 
about 30 minutes to about 2 hours, from about 30 minutes to about 1 hour, 
from about 1 hour to about 4 hours, or from about 1 hour to about 2 hours. 
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[0432] The reaction may be stopped, for example, by addition of a suitable 

stopping reagent (e.g., 5 |a1 of 0.5M EDTA in a 20|al reaction). The resultant 
labeled nucleic acids may be purified using any standard technique (e.g., 
column purification using a SNAP column, Invitrogen Corporation, Carlsbad, 
CA). 

[0433] Labeled nucleic acid produced by the methods of the invention may be 

used for a variety of purposes, for example, to detect a target sequence, which 
may be present on an array. Typically, labeled nucleic acid of the invention 
may be hybridized to one or more target sequences (e.g., a microarray). 

[0434] In some embodiments, polypeptides of the invention may be used to 

prepare labeled nucleic acids by indirect labeling. For example, a modified 
nucleotide (e.g., an amino allyl nucleotide) may be incorporated into a nucleic 
acid molecule using the polypeptides of the invention. The nucleic acid 
molecules containing the modified nucleotide may then be reacted with a 
reactive molecule that comprises a detectable moiety (e.g., a fluorescent 
moiety, radiolabel and the like). All or a portion of the reactive molecule and 
the detectable moiety may then become attached to the nucleic acid molecule. 

[0435] Suitable reaction conditions for indirect labeling include those listed 

above. For example, RNA (e.g., 5-50 jig) can be mixed with a suitable primer 
(e.g., an oligo(dT) 2 5 for polyA tailed RNA), mixed and incubated, for 
example, at 70°C for 5 min, and then chilled on ice. To the primer: template 
mixture, addition reaction components may be added. For example, for a 30 
\x\ reaction volume, suitable component may include: 6 |xl 5X first strand 
buffer, 1.5 jal 0.1M DTT, 1 \x\ RNaseOUT (40u/|al), 1.5 |al 10 mM dNTP 
mixture containing 10 mM dATP, 10 mM dCTP, 10 mM dGTP, 4 mM dTTP, 
and 6 mM Aminoallyl-dUTP), 2 ^1 Superscript™ III (200u/jil), X jal dH 2 0 
to a total volume of 30 The mixture may be incubated at a suitable 
temperature for a suitable time such as those described above, for example, at 
50°C for 2 hours. The mixture may be treated to degrade the RNA template 
(e.g., by the addition of 15^1 IN NaOH, and heating to 70°C for 10 min). The 
pH of the solution may then be adjusted (e.g., by the addition of 15 |xl IN 
HC1) and the nucleic acid containing the modified nucleotide may be purified 
by standard techniques (e.g., using a SNAP column, Invitrogen Corporation, 
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Carlsbad, CA). The purified nucleic acid may be concentrated (e.g., by 
ethanol precipitation), 

[0436] After ethanol precipitation, the nucleic acid containing the modified 

nucleotide may be resuspended in a buffer suitable for coupling the reactive 
molecule containing the detectable moiety to the nucleic acid. For example, 
the nucleic acid may be resuspended in 5 \xl coupling buffer (e.g., 0.1 M 
sodium borate, pH 8.5). The reactive molecule (e.g., a molecule comprising a 
dye and a reactive functionality) may be added (e.g., 5 |xl mono functional Cy3 
or Cy5 dye from APB Cat#PA23001 and # PA25001). When the reactive 
molecule is a dye, the dye may be suspended in any suitable solvent (e.g., 
DMSO). In one embodiment, 1 pack of CyDye was resuspended in 45 (4,1 
DMSO and used as above. The reaction may be incubated for a suitable 
length of time and at a suitable temperature, for example, at room temperature 
for 1 hour. When the dye is light sensitive, the incubation is preferably 
performed in the dark. The coupling reaction is stopped by the addition of a 
reagent such as 5 jal of 4M hydroxylamine. The labeled nucleic acid is then 
purified (e.g., using a SNAP column, Invitrogen Corporation, Carlsbad, CA). 

[0437] Labeled nucleic acids (whether prepared by direct or indirect labeling) 

may be hybridized to one or more target sequences, which may be in any 
suitable form, for example, in a microarray. The labeled nucleic acids are 
denatured (e.g., by heating at 95°C for 2 min). The denatured, labeled nucleic 
acids are brought into contact with the target sequences in a suitable buffer. 
For example, a microarray slide can be placed in a hybridization buffer (e.g., 
25 % Formamide, 5 X SSC, 50 mM MES pH 6.5, 0.1% SDS) that contains 
labeled nucleic acid (e.g., -1 jag of Dye-label probe). The microarray can be 
incubated for a suitable time at a suitable temperature, for example, at 42°C 
for 16 hours. The microarray may then be washed one or more times, for 
example, the hybridization solution may be removed and the microarray 
washed in Wash I (5 X SSC, 0.2% SDS), followed by a wash in Wash II (1 X 
SSC, 0.2% SDS) at 42°C for 5 min, followed by a wash in Wash III (0.1 X 
SSC) for 2 min. The microarray may be dried, for example, by spin drying the 
microarray at 600 rpm for 5 min. 
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[0438] The hybridized microarray may be analyzed using any techniques 

known in the art. 

[0439] All publications, patents and patent applications mentioned in this 

specification are indicative of the level of skill of those skilled in the art to 
which this invention pertains, and are herein incorporated by reference to the 
same extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporate by reference. 



